1 Summary of things you should already know 



"I think I can safely say that nobody understands quantum mechanics" Richard Feynman 

1.0 Prerequisite 

All material covered in Junior Honours Quantum Mechanics is part of the syllabus of this course. 

1.1 Understanding Nature 

Quantum Theory encompasses our best understanding of how nature works: what will be the result 
of any experiment. We arbitrarily split the universe into "system" (wavefunction), an environment 
or measurement (Hamiltonian, or operator) and a measureable quantity (eigenvalue). There is 
no unique associated mathematics, but since all measurements on systems yield real numbers, we 
need mathematics which gives real eigenvalues. So it is a premise of quantum theory that any 
measurable quantity is associated with a Hermitian operator. 

1.2 Matrix and operator mechanics 

There are two equivalent mathematical ways of calculating physical properties, Schroedinger's 
wave mechanics and Heisenberg's matrix mechanics. In each systems are represented in terms of 
eigenstates and measurables as eigenvalues. In matrix mechanics the operator is represented by 
a Hermitian matrix of elements (m\Q\n) which depends on the choice of basis set \m). Any state 
can represented by a normalised vector, which also depends on the basis set. The eigenvalues and 
eigenvectors of the matrix, however, do not depend on the choice of basis - the eigenvectors are, 
in fact, the eigenbasis of the operator. 

For a set of basis 'vectors' of size N, there are N x N possible matrix elements. 

1.3 Operators and Observables 

In addition to position, a full description of a system must contain some implicit information. The 
abstract bra-ket notation includes this. 

Consider the electric charge. Obviously this is measurable, so it should be associated with an 
operator Q, such that e.g. 

Q|<t>> = -e|$) 

where <£> is the wavefunction of an electron. — e meets all the criteria for a quantum number, and 
the above equation is obviously a true representation of reality. Thus the meaning of the ket |$) is 
broader than a simple spatial function, and operators can also be non-algebraic. This is especially 
important in particle physics where all manner of quantum numbers appear (isospin, strangeness, 
baryon number etc. etc.) 

1.4 Changes in time 

Schroedinger's equation Hep = ihd(f>/dt shows us that the Hamiltonian (energy operator) is related 
to the change in wavefunction in time. A system prepared in an eigenstate of the Hamiltonian 
has time-invariant probability density. A system prepared in an eigenstate of a non-commuting 
operator has a probability density which varies in time. It is this time independence (conservation 
law) which makes eigenstates of the energy operator so useful. 



When we measure some property of a system, the act of making the measurement collapses the 
system into an eigenstate of the appropriate operator. All memory of the previous state of the 
system is lost in this collapse, except in the special case when the state is degenerate, as we'll see 
later. The system then evolves according to its Hamiltonian. 



1.5 Formal definition of a complete, orthonormal basis set 

Consider a basis set \i n ). It is orthonormal if (i n \i m ) = S mn . It is complete if any wavefunction 
can be written as \<p) = ^Z n c n \i n ) and the c„ are uniquely defined. If the wavefunction cannot be 
so written, the basis set is incomplete, if there exists more than one possible set of c n , the basis 
set is overcomplete. Choosing a basis set in a Hilbert space (see 1.7) is analogous to choosing a 
set of coordinates in a vector space. Note that completeness and orthonormality are well defined 
concepts for both vector spaces and function spaces. 



1.6 Example of matrix representation method and choice of basis 

In practical quantum problems, we almost always describe the state of the system in terms of 
some basis set. Consider a simple spin 1/2 system, choosing as basis states S z = ±|. Consider 
this system in a magnetic field pointing in the x direction, the operator corresponding to this is 
fiBS x . We wish to find the eigenstates and eigenenergies. 

Evaluating the required matrix elements such as (S z = \\(iBS x \S z = \) (see QP3) gives a matrix: 

( mB/2\ 
V »B/2 ) 



The normalised eigenvectors of this matrix are (\J\, \Z§) an d (\/|' ~\f\^) w ^ n eigenvalues {fxB/2) 
and (—fiB/2). Of course these represent the eigenstates \S X = ±|) in the basis of \S Z = ±|): 



\s z 



-)±\s z = --) 



/V2 



Had we chosen \S y = ±|) as our basis set, then the matrix would have been: 

/ -ifiB/2 \ 
{ ifiB/2 j 



Once again, the eigenvalues of this matrix are (fiB/2) and (—fiB/2), as they must be since these 
are the measurable quantities. Coincidently, the eigenvectors in this basis set are also (\J\, \J\) 

and (y/J-y/J). 

Had we chosen \S X = ±|) as our basis set in the first place, the problem would have been much 
simplified. The matrix would then be: 



/ fiB/2 
^ -fiB/2 



Once again, the eigenvalues of this matrix are (fiB/2) and (—fj,B/2), and now the eigenvectors 
are (1,0) and (0,1): i.e. the eigenstates are simply the basis states. 



1.7 Dirac Notation - Analogies with vectors and matrices 



You probably remember Dirac notation as a shorthand for integrals, for example the overlap 
between two wavef unctions can be written as: 

(X\4>) instead of j J J x*(r)0(r)d 3 r. 

(Where <i 3 r is the scalar volume element, sometimes called r 2 sin 6 'd6d<j)dr, dxdydz, dV or dr) 

But also if we have a complete set of ortho normal basis states i, the overlap is also the sum of the 
overlaps between each i and x an d 

(xl0> = £(xl*X*l0> 

i 

Warning: A summation convention is also sometimes used, such that when a state symbol appears 
twice, first as a ket, then as a bra, it is assumed to be summed over a complete set of orthonormal 
basis states. The expression above is then further abbreviated to (x\i) (i\(f)) ■ This convention can 
be confusing and will not be used in these notes. 

Compare this with the vector dot product formula 

b.a = b x a x + b y a y + b z a z = ^(b.ei)(ei.a) 

i 

where ei are the unit vectors in x, y and z directions. Just as any vector can be expressed as 
a linear combination of ei, so any quantum state can be expressed as a linear combination of 
basis states %. There are certain conditions on the basis states, e.g. they must be 'orthonormal' 
= Sij just as ei.ej = 5^. Just as the three Cartesian vectors span a three dimensional space, 
so the many basis states span a many-dimensional space. In some cases (e.g. Fourier expansions, 
hydrogen wavefunctions) there are an infinite number of basis states which are therefore related 
to spanning an infinite-dimensional space. Mathematicians call these 'Hubert spaces'. Any state 
can thus be viewed as a vector in a multi-dimensional space, where each dimension corresponds 
to one of the basis functions. It is thus common to use the words eigenstate and eigenvector 
interchangeably to refer to |0) Even before the discovery of quantum mechanics, mathematicians 
had solved many of the problems in this 

In Dirac notation we have two quantities, the bra and the ket, whereas in vector algebra we 
have only one, this is because there is not an exact analogy to commutation for Dirac brackets: 
(x\4>) = (0|x)* includes taking a complex conjugate. Consider manipulating the bras and kets. 
We can write a vector in terms of its components thus 

A = ^ei(ei.A) 

i 

where (ei.A) is the amount of A along the ei axis; the components. The quantities on either side 
of the equation are not numbers but vectors. We can generate a whole algebra based on vectors. 

Likewise we can write a state thus: \<f>) = J2i K)(^|0) 

where (i\<f>) is the amount of along the i basis state; the components or expansion coefficients. 
The quantities on each side of this equation are not numbers but functions. is a normalised 
wavefunction iff J2i l(i|0)| 2 — 1- We can then generate a whole algebra based on bras and kets. 

For any different complete sets of basis states i and j, we can write: |0) = J2j |i)(i|0)> an d 
10) = J2i K)(^|0)- Expansions in % and j are called different representations of 0. This is very 



similar to using different coordinate systems: the bases i and j are analogous to two sets of 
axes rotated with respect to one another. We might choose complete set of wavefunctions as a 
representation which includes 0, just as we sometimes choose axes such that some special vector 
points along the z-axis. 

Going even further, the expansion in a basis can be done for any \<f>), so we can dispense with \(f>) 
and write: 

I = E N)(*l> the unit operator 

i 

All this means is that in any equation you can always proceed by breaking the states down into a 
complete, orthonormal set of basis functions. This may be useful when dealing with a Hamiltonian 
for which the eigenstates i with eigenenergies Ei are already known. A general mixed state \(f>) 
has energy: 

((t>\Ho\4) = ££<#><i|#ob>W> = EK#)! 2 ^ since for i^j (j|i^>, = 

i j i 

So we could use the solution to an easier problem (the eigenvalue problem, which we need solve 
only once per Hamiltonian) so that we never need to apply the complicated Hamiltonian to the 
complicated mixed state! This is a very useful trick - reformulating a problem so that we can make 
use of some work that has already been done. In this case the single, hard, problem of finding 
the energy of a mixed state is changed to the many, easier, problems of finding the energy of the 
eigenstates and the amount of each eigenstate in the mixed state. 

1.8 Using Bras to pick Kets 

One of the most useful algebraic tricks in quantum mechanics is to multiply a sum of terms by 
a complex conjugate wavefunction, and integrate the product over all space. Orthogonality often 
means that this procedure can be used to 'pick' a single term from the sum. In Dirac notation 
this procedure simply becomes applying (i m \. 

For example, if we have an expansion of a mixed state <3> in eigenstates i n : H\&) = J2 n H\i n ) (i n \&), 
we can remove the sum by (i m \: 

{i m \H\$) = (im\J2H\i n )(i n \$) = E m {i m \Q) 

n 

This works because (i m \H\i n ) = E m 5 nm ; it is analogous to taking components of a vector. 

1.9 Good quantum numbers 

It is normal to think of the eigenvalues as labelling states. In that case they are just called 
quantum numbers. A set of eigenvalues from a complete commuting set of operators are called 
good quantum numbers. The eigenvalues from a non-commuting operator are a bad quantum 
numbers, because their values cannot be known simultaneously. 

This is not quite as simple as it seems. In real systems the Hamiltonian may contain many 
small terms (perturbations) which may not commute with the operators which commute with the 
unperturbed Hamiltonian. Although in principle the quantum numbers are no longer good, in 
practice they are often used. 

An example of this is in spin-orbit coupling of angular momenta in many-electron atoms. Here 
L z = J2i hz is a good quantum number in the absence of spin-orbit coupling, but U z does not 
commute with the spin orbit coupling operator J2ih-Si- Thus for light atoms, where spin-orbit 
coupling is weak, L z is often used although it is not strictly a good quantum number. 



2 Review: Time-Independent Non-degenerate Perturbation Theory 

There's nothing new in this section, its simply an alternative derivation to the one you saw last 
year in Junior Honours. If you prefered that derivation, feel free to read over those notes, the 
results are the same! 

2.1 Small changes to the Hamiltonian 

There are very few problems in quantum mechanics which can be solved exactly. However, we 
are often interested in the effect of a small change to a system, and in such cases we can proceed 
by assuming that this causes only a small change in the eigenstates. Perturbation theory pro- 
vides a method for finding approximate energy eigenvalues and eigenfunctions for a system whose 
Hamiltonian is of the form 

H = H + V 

where Hq is the 'main bit' of the Hamiltonian of an exactly solvable system, for which we know 
the eigenvalues, E n , and eigenfunctions, \n), and V is a small, time-independent perturbation. H, 
H and V are Hermitean operators. Using perturbation theory, we can get approximate solutions 
for H using as basis functions eigenstates of the similar, exactly solvable system H$. 

Assuming that H and H possess discrete, non-degenerate eigenvalues only, we write 

H K) = Ei \rii) 

in Dirac notation. The states \n,j) are orthonormal. WLOG, consider a state % — 0: the effect of 
the perturbation will be to modify the state and its corresponding energy slightly; The eigenstate 
\no) will become |0o) and E will shift to E + AE , where 

H\<f> ) = E + AE \(f) } 
WLOG, expanding |0 O ) m the basis set \rii) with coefficients Qo and premultiplying by (tiq\ 

(n \(H + V) Ciolrii) = (E + AE ) (n \ ^ c i0 \m) 

i=0,oo i=0, oo 

Which after a little algebra and cancellation yields the exact result: 

A£ = (n \V |n > + £ (c i0 /c 00 ){n \V \m) (1) 

i=l,oo 

Similarly, expanding |0 O ) m the basis set \rii) and premultiplying by another state {rik\ 
(n k \(H + V) c i0 \ni) = (E + AE ) (n k \ ^ c i0 \m) 

i=0,oo j=0,oo 

leading to |0o ) having a component of \rik) 

c k0 {E + AE -E k ) = Cio{n k \V\m) (2) 

i=0,oo 

Note that although we have denoted the unperturbed state as \n ), it is not necessarily the ground 
state. 



2.2 First order energy shifts 



In first order perturbation theory, we assume that the change in the wavefunction is small, i.e. 
|cio/coo| IVi and neglect the second term in equation 1 which becomes. 

AE ~ (n \V\n ) = V 00 

which is one of the most useful results in quantum mechanics. It tells us how to calculate the 
change in the nth energy eigenvalue, to first order: 

The shift in energy induced by a perturbation is given to first order by 
the expectation value of the perturbation with respect to the unperturbed 
state. 

Thus first order time independent perturbation is equivalent to making the approximation that the 
wavefunction does not change. Loosely, this works because the energy depends on the perturbation 
to first order, but on wavefunction squared. 

2.3 Mixing of the eigenstates of H 

Turning to equation 2, we make the approximation c i0 <C c 00 ~ lWi ^ so that the only 
significant term in the sum comes from i = 0, and also that AEq is negligible compared to the 
energy difference between states and k: 

c k0 ^{n k \V\n )/(E -E k ) (3) 

Using these coefficients, we see that the perturbation causes a first-order correction to the energy 
eigenvector |n ): 

i , v i v , (nk\V\no) . , _ . , ^ Vfco . . 

fc^o K^o - MQ - & k ) 

Which defines the matrix element Vy for i = k,j = 0. We speak of the perturbation mixing the 
unperturbed eig en functions since the effect is to add to the unperturbed eigenfunction, |n ), a 
small amount of each of the other unperturbed eigenfunctions. The denominator suggests that 
states with similar energies are more strongly mixed, and the "matrix element" determines how 
the perturbation mixes the states. 

Unlike the formula for the energy shift, we are faced in general with evaluating an infinite sum to 
find the correction to the eigenfunctions. 



2.4 Higher Orders 

It may turn out that the matrix element Vqq is zero, often due to symmetry. In this case we must 
consider what happens at second order. Going back to equation 1, and using our expression for 
mixing and assumption cqo ~ 1 



2.5 



Notes 



• The results in 2.2 2.3 and 2.4 are worth memorising: physicists use them without proof. 

• Energy shifts are real numbers, but matrix elements may be complex. 

• If the perturbation operator commutes with the Hamiltonian, "Off-diagonal" matrix ele- 
ments (Vij, i 7^ j) are zero. Such perturbations change the energy, but not the wavefunction. 

• If the perturbation is turned on and off again, the off-diagonal matrix elements determine 
whether the quantum state is changed. 

• To help with notation, we have derived results for perturbation to a state labelled by 0. This 
is not necessarily the ground state - the above derivation is general. 

• For the first-order changes to the eigenfunction to be small we must have: 

(n k \V \n ) = Vfco < \(E - E k )\ for silken 

• Similarly, we require that the level shift be small compared to the level spacing in the 
unperturbed system: 

|A£ | < min|(£ - E k \ 

• These conditions may break down if there are degeneracies in the unperturbed system. How- 
ever, we need only assume that the particular energy level whose shift we are calculating is 
non- degenerate for the preceding analysis to be correct. 

• The first order corrected wavefunctions are not fully normalised. 

• The second order term always lowers the energy of the ground state. 



2.6 Example 

Consider a simple harmonic oscillator in its ground state, to which we apply a perturbation 
V = Xx 2 . We know the unperturbed wavefunction \uq) = [mooo/nfi} 1 exp{—muJoX 2 /2h}, so we can 
evaluate the first order shift in energy according to the perturbation theory: 



AE = (n |Aa; 2 |no) = \\/mu) /7rh / x 2 exp{— mu x 2 /h}dx = — 

v J ' 2 uiuq 

In this case we know the exact shift, since the perturbation is simply an additional harmonic 
potential, giving a total k = mu 2 + 2A and an exact ground state energy of \Ti\Juj 2 , + 2A/m. It is 
easy to verify that to first order in A these expressions are identical. 

To determine the amount of mixing of states, we need to evaluate matrix elements like (n |Ax 2 |nj). 
We won't evaluate these here, but we will note that for odd i the integral is zero - the symmetric 
perturbation only mixes in symmetric excited states. 



3 Dealing with Degeneracy 



3.1 Time-Independent Degenerate Perturbation Theory 



We have seen how we can find approximate solutions for a system whose Hamiltonian is of the 
form 

H = H + V 

When we assumed that H and H possess discrete, non-degenerate eigenvalues only. This led to 
a mixing of states where 

|0 O ) = K> + Yl I rp \ K> 

k^O K^o - 

Clearly, if E = Ej, this diverges. As do the higher order energy shifts (see 2.4). Thus for 
the degenerate case we cannot associate a particular xperturbed state |0o) with a particular 
unperturbed state \uq): we need to take a different approach. In fact, the approximation we make 
is completely different: we assume that the small perturbation only mixes those states which are 
degenerate. We then solve the problem exactly for that subset of states. 

Assume that H possesses N degenerate eigenstates |m) with eigenvalue E deg . It may also pos- 
sesses non-degenerate eigenstates, which can be treated separately by non-degenerate perturbation 
theory. We write a perturbed eigenstate \(f>j) as an linear expansion in the unperturbed degenerate 
eigenstates only: 



\<t>i) = K)M0i> = J2 c Ji K> 

i i 

Where i here runs over degenerate states only. The TISE now becomes: 

[ H + V] \(j>j) = [H + V}J2 °ni K) = E 3 Yl C ™ \ m i 



but we know that for all degenerate eigenstates Ho\rrii) = Edeg \™>i)- So we obtain: 



Y cji V\rm) = {Ej - E deg ) Y c 



premultiplying by some unperturbed state {m,k\ gives 

J2 c ji (m k \V\mi) - Si^Ej - E deg ) 



We can get a similar equation from each unperturbed state \rrik)- We thus have an eigenvalue 
problem: the eigenvector has elements and the eigenvalues are AEj = Ej — E deg . Writing the 
matrix elements between the i th and k th unperturbed degenerate states as Vik = (mi\V\mk) we 
recover the determinantal equation: 



V n - AEj V 12 
V 21 V 22 - AEj 



Nl 



Vr 



N2 



v 1N 

Vnn — AEj 



The eigenvalues obtained by solving this equation give the shifts in energy due to the pertur- 
bation, and the eigenvectors give the perturbed states \<p) in the unperturbed, degenerate basis 
set \m). 



3 Dealing with Degeneracy 



3.1 Time-Independent Degenerate Perturbation Theory 



We have seen how we can find approximate solutions for a system whose Hamiltonian is of the 
form 

H = H + V 

When we assumed that H and H possess discrete, non-degenerate eigenvalues only. This led to 
a mixing of states where 

|0 O ) = K> + Yl I rp \ K> 

k^O K^O ~ 

Clearly, if E = Ej, this diverges. As do the higher order energy shifts (see 2.4). Thus for the 
degenerate case we cannot associate a particular perturbed state |0o) with a particular unperturbed 
state \n ): we need to take a different approach. In fact, the approximation we make is completely 
different: we assume that the small perturbation only mixes those states which are degenerate. 
We then solve the problem exactly for that subset of states. 

Assume that H possesses N degenerate eigenstates |m) with eigenvalue E deg . It may also pos- 
sesses non-degenerate eigenstates, which can be treated separately by non-degenerate perturbation 
theory. We write a perturbed eigenstate \(f>j) as an linear expansion in the unperturbed degenerate 
eigenstates only: 



\<t>i) = H K)M0i> = J2 c Ji K> 

i i 

Where % here runs over degenerate states only. The TISE now becomes: 

[Ho + y\ \<t>j) = [Ho + v]J2 °ji K> = e j E c a \ m *) 



but we know that for all degenerate eigenstates Ho\rrii) = Edeg \™>i)- So we obtain: 



cji V\rm) = (Ej - E deg ) Y c 



premultiplying by some unperturbed state {m,k\ gives 

J2 c ji (m k \V\mi) - Si^Ej - E deg ) 



We can get a similar equation from each unperturbed state \rrik)- We thus have an eigenvalue 
problem: the eigenvector has elements and the eigenvalues are AEj = Ej — E deg . Writing the 
matrix elements between the i th and k th unperturbed degenerate states as Vik = (mi\V\mk) we 
recover the determinantal equation: 



V n - AEj V 12 
V 21 V 22 - AEj 



Nl 



Vr 



N2 



v 1N 

Vnn — AEj 



The N eigenvalues obtained by solving this equation give the shifts in energy due to the pertur- 
bation, and the eigenvectors give the perturbed states \<p) in the unperturbed, degenerate basis 
set \m). 



3.2 



Notes 



• The perturbed eigenstates of H are linear combinations of degenerate eigenstates of H . 
This means that they too are eigenstates of H from a different eigenbasis. 

• If Ho is compatible with V, i.e. [Hq, V] = 0, then there is no mixing with non-degenerate 
states and the analysis above is exact. 

• Notice how the mathematics mimics the quantum mechanics. Without the perturbation 
the eigenbasis of H is not unique. When we try to determine its energy shift we find a 
matrix equation which can only be solved for specific values of AEj. These AEj in turn 
correspond to specific choices for the coefficients Cji, i.e. particular linear combinations of the 
unperturbed states. Thus to solve the equations we are forced to collapse the wavefunction 
onto an eigenstate of V. Vki is a Hermitian matrix, and consequently has real eigenvalues. 



3.3 Example of degenerate perturbation theory: Stark Effect in Hydrogen 

The change in energy levels in an atom due to an external electric field is known as the Stark 
effect. The perturbing potential is thus V = eEz = eErcosO. Ignoring spin, we examine this 
effect on the fourfold degenerate n=2 levels. We will label these by their appropriate quantum 
number: \l,m). 

u 00 = (Snal)- 1 ' 2 ^ - r/2a )e- r/2ao ; u w = (Mal)' 1 ' 2 ^ /2a Q ) cos 9e~ r ' 2a ° 
Uu = (Tral)-^ 2 (r/8ao) sin ^e" r/2a ° = (7ra 3 )-^ 2 (r/8a ) sin &r^e- r/2ao 

From the analysis above, we need to calculate the matrix elements. 

Vim,i'm' — (l,m\eEz\l',m') = eE J J J u t m ( r cos @) u i'm' r ' 2 sm9d9d(pdr 

It turns out that many of these are zero, since if any of the integrals are zero their product will 
be. Looking first at parity, it is clear that eEz has odd parity (eE(r) cos(7r — 9) — —eEr cos 9), 
Uoo has even parity and u\ m have odd parity. Since the integral over all space of any odd function 
is zero, Vbo.oo = V lmilm > = 0. Secondly, J 27r e ±i<f> d(p = 0, so V^n = Vbo,i-i = Vn^o = Vi_i )0 o = 0. 

Since the perturbation is real, Voo,io — Ko,oo an d the only remaining non-zero matrix element is: 

/•27T ["K roc 

(00|eErcos^|10) = (Svra^^ 1 / d<p / cos 2 9 sin OdO / (1 - r/2a )e~ r/a °r 4 /2a dr = -3eEa 

Jo Jo Jo 

This is best solved as a matrix problem, the determinantal equation is then: 



-AE 


— 3eEao 








SeEao 


-AE 














-AE 














-AE 



(AE) A -(AE) 2 (3eEa ) 2 = 



The solutions to this are AE = ±3eEa , 0, 0. The degeneracy of the states tin and ui-i is not 
lifted, but the new non-degenerate eigenstates corresponding to AE n = ±3eEao are mixtures, 
(■Uoo T u io)/V2- Consequently, the spectral line corresponding to the n = 2 — > n = 1 Lyman-a 
transition is split into three if the hydrogen atom is in an electric field. 

A curious aspect of these eigenstates is that they are not eigenstates of L 2 , although they are 
eigenstates of L z . Nor do they have definite parity. In an electric field, therefore, the total angular 



momentum is not a good quantum number. Note that this effect is specific to hydrogen, since in 
other elements the s and p levels are not degenerate. 

Experimental results confirm this theory beautifully - the splitting of levels in hydrogen varies 
linearly with the applied field strength, while in all other atoms it varies quadratically: the first 
order perturbation is zero. 

Looking at the electrostatics: the energy of a spherically symmetric charge density in a uniform 
field is clearly independent of orientation. To have any orientation dependence the object must 
have a dipole moment. The combination of 2s and 2p wavefunctions achieves this. 



3.4 Symmetry and Degeneracy 

In real systems degeneracy almost always related to symmetry. In general if the probability density 
has lower symmetry than the Hamiltonian, the wavefunction will be degenerate. 

There is a clear physical reason behind this. Consider the 2p x orbital in hydrogen: it has a lobe 
along the a>axis. However, there is no measurable quantity which defines an x-axis - the coordinate 
system is just introduced by physicists to help solve the equations. The lobe could just as well 
point in the y or z or (27,43.2, —12) direction. Thus the p x orbital has lower symmetry than the 
Hamiltonian (spherically symmetric potential), and is degenerate with p y and p z . Likewise the 
spin: we talk about 'spin up', but there is no way to define 'up' from the Hamiltonian. Thus there 
is degeneracy between spin states 'up' and 'down'. 

If we reduce the symmetry of the Hamiltonian, we now 'lift' the degeneracy, (i.e. the levels no 
longer have the same energy). For example, an applied magnetic field defines an axis and lowers 
the symmetry of the Hamiltonian. If the field is weak, we can use perturbation theory and assume 
we still have p orbitals (Zeeman effect). Now, the orbitals must be eigenstates not only of H , 
but also of /x.B where fi is the magnetic dipole moment. The degenerate energy level splits into 
several different energy levels, depending on the relative orientation of the moment and the field: 
The degeneracy is lifted by the reduction in symmetry. 



3.5 Time-variation of expectation values: Degeneracy and constants of motion 

The time variation of the expectation value of an operator A which commutes with the Hamiltonian 



is: 



dV 1 1 ' J dt dt 

d® - d$* 
but since ih— = if$ and -ih—— = H*§* 
dt dt 

-ih^-($\A\$) = [ (#*$*!$ - $*AH$)d 3 r = ($|[#,ip> 

Where we also use the fact that H is Hermitian. Thus if H commutes with A (\H,A\ = 0), the 
expectation value of A is independent of time. It is a conserved quantity. 

As we have seen above, if we have degenerate eigenstates of the Hamiltonian, H, then there must 
be some other operator A which commutes with the Hamiltonian for which there are energy- 
degenerate eigenstates with different eigenvalues A. These eigenvalues, A, are then constants of 
the motion. Moreover, if <3> is an eigenfunction of H, then A<3> is also an eigenfunction of H . 

H(A$) = AH<& = A(E$) = E(A<f>) 
There is a three way link between symmetry, degeneracy and conserved quantities. 




Figure 1: Any linear combination of two degenerate eigenstates produces another eigenstate. 
3.6 Wavefunction Collapse onto degenerate levels 

Refer back to the postulates of quantum mechanics: We know that acting with an operator A on 
an eigenstate \a n ) of that operator gives us an eigenvalue A n , which corresponds to a measurable 
quantity. 

There is no guarantee that \a n ) is the only eigenstate of A which has this eigenvalue (e.g. energy 
levels in hydrogen). Different states with the same eigenvalue are referred to as degenerate. 

Assume we find two orthogonal, degenerate eigenstates of A: \a±) and |a: 2 ). i-e. A\a\) = Ai\a±) 
and A\a2) = Ai\ai2)- We also see that 

A (cos#|«i) + sin#|a 2 )) = A 1 (cos0|ai) + sin0|a 2 )) 

for any 9. We use cos 6* for the expansion instead of the normal q to emphasise the similarity 
between eigenstates and vectors. It also allows for easy normalisation since cos 2 9 + sin 2 9=1. 

Thus any linear combination of degenerate eigenstates produces another eigenstate. There is still 
only twofold degeneracy, because there are only two orthogonal states, (sin#|ai) — cos 9\a2)) being 
the other one. The complete set of orthonormal eigenstates for A is thus not a unique quantity, 
since we can choose any 9 to generate a pair of degenerate eigenstates. 

A consequence of this is that when a measurement is made of A which finds A 1 , there is not a 
complete collapse of the wavefunction. 

Consider measuring observable A in a system in a general state |<&). By expanding |<E>) in the 
eigenstates of A: |$) = J2i c i\ a i) we find the probability that the measurement will yield result 
A 1 is 

|<a!|^>| 2 + |<a 2 |^>r = |c x | 2 + leal 2 

The measurement has determined that we are either in state a.\ or a 2 , but not which. Thus there 
is a partial collapse of the wavefunction onto a linear combination of them: 

(cos0|a!i) + sin#|a 2 )); cos 9 — — -. 1 

V|ci| 2 + |c 2 | 2 

which is itself an eigenvector of A. 

Thus, in the case of degenerate final states, the final wavefunction after the measurement does 
depend on the initial wavefunction. The generalisation of this to the case of many degenerate 
states is straightforward. 



-i -i 



4 Degeneracy, Symmetry and Conservation Laws 



4.1 Distinguishing between eigenstates, Quantum numbers as labels 

How can we distinguish between quantum states \a n ) which have degenerate values of A? The 
obvious way is to measure the quantised observables and use them to label the state. We must be 
sure not to make measurements which change the state. Thus all measurements should correspond 
to commuting operators (Compatible observations: see QP3). In the non-degenerate case mea- 
suring energy is sufficient, but in hydrogen, for example, we used quantum numbers n (for energy, 
operator H), I (for total angular momentum, L 2 ) and mi (one component of angular momentum, 

4). 

Continuing the example of twofold degeneracy (3.6), suppose that some operator B is compatible 
with A. This means that [A, B] = and A and B have a common eigenbasis. i.e. some 9 and 
9 + 7r/2 give eigenstates of both A and B in the form \ot(9)) = (cos9\ai) + sin 9\ a 2 ))- 

To find the appropriate value of 9, we have a similar problem to that encountered in 3.1 and must 
solve for the eigenvectors of: 

(ai\B\ai) (a\\B\a 2 ) \ 
(a 2 \B\a 1 ) (a 2 \B\a 2 ) J 

The eigenvalues of this equation are the quantised measurable values of B. If both of these are 
equal, there must be another measurable C which will distinguish the two states. 

The generalisation to many degenerate levels is straightforward. If there are n orthogonal degener- 
ate eigenstates of A, (therefore an n-dimensional space in which every unit vector is an eigenstate 
of A), compatibility of eigenbases means there are at least n eigenstates of B. It is now possi- 
ble that all these have different B eigenvalues, or that at least two have the same eigenvalue, in 
which case if we want a specific set of orthogonal eigenstates, we must look for another compatible 
operator C. 

When the set of operators is sufficiently large that there is a unique set of eigenvalues for each 
eigenstate, we call it a complete commuting set of operators. An example is H, L 2 , S z and L z in 
hydrogen. The complete commuting set is not unique for a given Hamiltonian, for hydrogen we 
could have used H, L 2 , S x and L x or H, L 2 , J and L z If one of the quantum numbers can be 
written in terms of the others then it is redundant. If two of the quantum numbers come from 
non-commuting operators, then the set does not define a state since the full set of measurements 
could not be performed without changing the wavefunction. 

4.2 Example 

Consider the 2D harmonic oscillator Vq = ^muj 2 (x 2 + y 2 ). If we measure the energy and find it to 
be 2hu, then the state could be \n x = l,n y = 0) or \n x = 0,n y = 1) or any linear combination. 
To fully define any state we require any two quantum numbers: n x , n y and E = {n x + n y + l)hu. 

Suppose we measure the energy and find 3hu: there is a partial collapse of the wavefunction and 
there are three degenerate possibilities. Suppose we then apply a perturbation AV = \x 2 (see 
2.6). This breaks the symmetry and collapses the wavefunction onto either |1, 1) |2,0) or |0,2). 
The perturbation matrix (see 3.1) (n x , n y \ AV\n x , n y ) is diagonal provided we choose the basis 
with x along the direction of the perturbation, and it has eigenvalues {n x + ^)\h/mu. If we then 
measure the energy and find E = 3hu + \h/2mu then we know that the state is |0, 2): a complete 
collapse onto a single wavefunction. 

Aside: Consider mixing with the non-degenerate states. By symmetry (1, 0|Aa; 2 |2, 0) = 0: the 
perturbation does not mix n x = and n x = 1 states, nor does it affect n y (see 2.3). Thus applying 
the perturbation may induce a transition from |0,2) to |2,2), |4, 2) etc. but not to n x = odd or 
n y 7^ 2. This gives rise to selection rules 



4.3 Translational Symmetry and Conservation of Momentum 



Consider a transformation operator in 1 dimension D which acts on the coordinates of a system as 
a displacement D[f(x)] = f(x+l). The eigenfunctions of D satisfy D\4>(x)) = d\(p(x)) = \(j)(x+l)). 
The general solutions to this equation are cj)(x) = e tkx u(x) where u(x) satisfies u(x) = u(x + /) 
and k is complex. 

This kind of translational symmetry exists when we have a crystal structure. Now consider a ID 
closed loop of N atoms: Uniqueness of the wavefunction requires that (p(x) = <p(x + Nl) - 
e ik(x+Ni) _ Thus p OSS ible wavefunctions must have real k and the form 

(f)( x ) = e 27rmx/m u(x); k = 27m/ Nl 



e = 



The momentum of this state is given by: 



—%h 



ax 



<f>*(x) 



2nhn u'(x) 
Nl + u(x) 



4>(x)dx 



The RHS first term gives is the familiar Hk, which we associate with the momentum. If u(x) 
has some definite parity, then u'(x) will have opposite parity and the second term will be the 
integral of an odd function (i.e. zero by symmetry). Thus k is a quantum number associated with 
translational symmetry, which in turn has an operator D which commutes with the Hamiltonian 
and is thus a constant of the motion. Translational symmetry is associated with conservation of 
momentum. 

For which the TISE, with the atom described by a potential V(x), and a particular value of k, 
can be written 



H k u k (x) = 



2m 



(k 



. d , 
dx' 



+ V(x) 



u k (x) = E k u k {x) 



since the phase has been eliminated, we simply have a particle in a fixed volume u k (x) = u k (x + l)), 
which means a series of discrete energy levels (bands). Thus all states can be labelled by k and a 
band index n. 

We can write the semiclassical group velocity of the wavefunction as 



du 
dk 



1 dE 
h dk 



using E = hu. A formal proof using the velocity operator gives the same result for the velocity. 
Assuming that E does vary with k, this means we have a time-independent state which nevertheless 
has a permanent, non-zero velocity through the lattice. 



4.4 Application - electron in a crystalline solid 

The above is the ID statement of Bloch's Theorem, the basis of study of electrons in solids. If we 
imagine applying an electric field (£) in the x-direction, then the rate at which work is done is: 

dE dE dk 
—ecv a = — — = —rz- — 
9 dt dk dt 

Using the expression for v g we find that the rate of change of hk is proportional to the external 
force, rather like Newton's second law. 

-eS = E = h — 
dt 



-i o 



If we now consider acceleration: 



dv g dv g dk 1 d 2 E ^ 

we find a quantity ^ 2 /^f which is known as the effective mass, relating external force to acceler- 
ation in a solid, and allowing us to avoid further consideration of the effect of the lattice. 



4.5 The Kronig-Penney Model 




Figure 2: The Kronig-Penney potential and a Bloch function 

In 4.1 u(x) is still completely general. The Kronig-Penney model considers a periodically repeating 
square potential defined in one cell by V(x) = (0 < x < b); V(x) = Vq (b < x < I), then we can 
solve for u(x) in one cell. Like the finite square well, this is a tedious boundary condition problem 
where matching value and slope of the wavefunction at the potential edge gives a 4x4 matrix to 
diagonalise. The details are given in wikipedia(!) and lead to an equation the LHS of which is 
drawn below: 



cos k\b cos k 2 (l — b) — 



2hk 2 



sin k\b sin k 2 (l — b) — cos kl 



where k\ = \/2mE/Ti and k 2 = \jlm{E — Vq)/K, the appropriate free particle wavevectors, thus 
for E < Vo, k 2 is imaginary. As the figure shows, multiple solutions are possible for all k, giving 
certain "bands" of energy, but not others. 



+1 



-1 



71 



kl 











































Energy 




1 




\ 











E/Vo 



Figure 3: Graph of function arising from multiple square- well problem: Allowed energy solutions 
exist only where | cos kl\ < 1. 



The key point about this equation is that it cannot be solved for certain values of E, around 
k\b = inn. A plot of the left hand side of the equation against E/Vq illustrates this, solutions 
for some value of k can be found only in the shaded regions of E. Moreover each shaded region 
contains N allowed k = 2nn/Nl values. Thus if each atom contributes two electrons the lower 
'valence' band will be filled (one of each spin in each state) and the upper 'conduction' band will 
be empty. To get an electron to move (change to a different fc-state) requires a lot of energy, so 
this represents an insulator. 

In the limit of Vq = 0, we get k — k± — ki — \ / 2mE/h, the free electron result, while for very large 
Vo >> E solutions are possible only for values of E which satisfy sin(A;i&) ~ 0, i.e. the square 
well. 

The wavefunction is a complex exponential of k±x or fc 2 x, depending on whether it is in a well 
or not. It is not and eigenfunction of the momentum operator. Thus although hk looks like a 
momentum, it isn't the eigenvalue of the momentum operator. It is called "crystal momentum" 
and along with the "effective mass" gives a pair of quantities with which we can apply Newtonian 
dynamics thinking to a crystal, ignoring the effects of the lattice. 

In three dimensions, the topology of the bands becomes much more complicated: this is a topic 
for solid state physics. 

4.6 Radioactive decay and imaginary potentials 

If the number of particles in a given state is reduced in time, then the total intensity of that state 
is reduced. Consider a particle moving in a region of imaginary potential V(r) = —iVq. The 
TDSE is: 

ih^\Q,t) = [H -iV Q \\Q,t) 

Assume that the time independent part of the state is an combination of eigenstates of the real 
part of the Hamiltonian: 

I'M) =E C «W exp(-iE n t/h) \(p n ); where H \<p n ) = E n \<f> n ) 

n 

Following the same analysis as for TDSE, premultiplying by (m|, and for constant Vo, V mn = S mn V 
we obtain: 

ihc m = -iV c m |c m (t)| 2 = |c m (0)| 2 e- 2y °*/ ft 

Thus the probability amplitude of the state decreases in time. An imaginary potential can be 
used to represent destruction of particles, either by absorption (in a scattering process, perhaps) 
or by radioactive decay. Obviously the ket is not a full description of the system, since that should 
include information about the decay products. The lifetime of the state is r = h/2Vo- 

Notice that — iVo is not a Hermitian operator, and so it is not possible to perform a single mea- 
surement of half life. 



-i r 



5 Time— dependence 

5.1 Time— dependent Hamiltonians 

Recall that for a system described by a Hamiltonian, H , which is time-independent, the most 
general state of the system can be described by a wavefunction \*&,t) which can be expanded in 
the energy eigenbasis {\n)} as follows: 

|*,*) = ^2c n exp(-iE n t/h) \n) 

n 

where the coefficients, c n , are time-independent, and E n denotes the eigenvalue corresponding to 
the energy eigenstate \n) of Ho- 

When we generalise to the case where the Hamiltonian is of the form 

H = H + V(t) 

we can again expand in \n), the time-independent eigenbasis of H 

\V,t) = ]Tc„(t) exp(-iE n t/h) \n) 

n 

but the coefficients, c n , will now in general be time-dependent. 

The wavefunction satisfies the time-dependent Schrodinger equation; 

ih^\%t) = H\*,t) 

so that we can substitute the expansion of \^f,t) to determine the equations satisfied by the 
coefficients c n (t). Writing E n = hu n and denoting the time derivative of c n by c n we obtain 

ih XX ^ - iu n c n ) exp(-iu n t) \n) = J2(c n hu n + c n V) exp(-iu n t) \n) 

n n 

which simplifies immediately to give 

^2(ihc n - c n V) exp(-iu n t)\n) = 

n 

We now premultiply this equation with another eigenstate of H , (m\, to give 

ihc m exY>(-iu rn t) - ^ c n V mn exp(-iu n t) = 

n 

giving the following set of coupled, first-order differential equations for the coefficients: 



ihc m — ^ n c n V mn exp^iujjnnt*) 



where u mn = u m - u n and V mn = (m\V\n). 

This tells us how the coefficient c m varies with time, i.e. the probability that a measurement will 
show the system to be in the m th eigenstate. It is exact, but not terribly useful because we must, 
in general, solve an infinite set of coupled differential equations. 

It is worth dwelling on the importance of the quantity V mn . This 'matrix element' is an integral 
which tells us how much the potential V mixes states \m) and \n). If it is zero (which it often is, 
by symmetry) then V cannot induce a transition between states \m) and \n). 



-i n 



5.2 Time-dependent Perturbation Theory 

Consider the Hamiltonian 

H = H + V(t) 

where the time dependent part is small. We can write the time dependent coefficients c n 

c n (t) = c n (0) + Ac n (t) 

Where c n (0) is the value of c n at t=0. We substitute in the equation for c m derived above to give 

c m (t) = (ik)' 1 J2l c n(°) + Ac n (t)]V mn exp(iuj mn t) 

n 

We can assume that for a perturbation c„(0) >> Ac„(t), and ignore the second term. This allows 
us to obtain the coefficients c m (t) by integrating the first-order differential equation to give: 

rt 

c m (t) = (ik)' 1 V c n (0) / V mn exp(iu; mn t) dt 

In the special case where the system is known to be in an eigenstate of H , say \k), at t — 0, then 
Cfc(0) = 1 and all other c m (0) = 0, m ^ k, giving 

c m {t) = (ih)- 1 f V mk exp(iu mk t) dt 
Jo 

Thus a system starting in a known eigenstate of the unperturbed system may transform to a 
different eigenstate through the action of the perturbing potential. Notice that c m (t) is an integral 
over time, if we wait a long time, the transition may become more likely. 

The probability of finding the system at a later time, t, in the state \m) where m ^ k is given by 

Pm(t) = \c m (t)\ 2 

Since we have assumed a small perturbation, this result is only reliable if p m {t) "C 1. "Small" here 
applies to both V mk and its integral over time. 



5.3 Time-independent Perturbations 

The results obtained in the last section can also be applied to the case where the perturbation, 
V, is actually independent of time (strictly, 'switched on' at t=0). 

Again, starting the system in eigenstate \k) of H we obtain, 

rt 

c m (t)Ac m (t) = {ih)~ l V mk I exp(iu; mk t) dt 

Jo 

[1 - exp(iuj rnk t)] 



huj mk 

for m 7^ k, giving for the transition probability 

\V mk \ 2 sm 2 (u mk t/2) 



p m (t) = \Ac m {t)\ 2 = 
For sufficiently large values of t, the function 

f(t,u mk ) = 



h 2 {uJ mk /2f 

sin 2 (o; m fct/2) 
(u mk /2) 2 



-i rr 




co nl -47t/t co nl -27i/t CO (O nj +27t/t co nl +4^/t 



Figure 4: Transition probability as a function of applied harmonic perturbation frequency 

consists essentially of a large peak, centred on u mk = 0, of height t 2 and width 47r/£, as 
indicated in Fig. 4. Thus there is only a significant transition probability if E m i? fc . That is, if 
|u; m fc| < 27r/t. 

Note that we are assuming that the system was prepared in some eigenstate of H which is not 
an eigenstate of V: if it were, then the matrix element V nm would be zero and p m (t) = 0. Thus 
although the analysis treats the perturbation as time independent, it is applied to cases where the 
perturbation is switched on at t = 0. Moreover only perturbations which are incompatible with 
the Hamiltonian can induce transitions. 

5.4 Harmonic Perturbation 

This is generally useful since by Fourier analysis we can decompose any periodic perturbation into 
harmonic components. 

Let the perturbing potential be V(r,t) = V(r) cos cut 

If the initial state at t — is k, and the final state m then 

c m « ^V mk / e^Ue^ + e-^)dt = ^ + — — 

h Jo 2 2h \ LO mk - OJ LU mk + u J 

where V mk is the time independent part of the matrix element (m\V\k). This function is dominated 
by the first term in the region around u mk - co, so we can consider only the first term to obtain 
an estimate for the transition probability: 

\0 m {t)\ - h 2 {(Jmk _ (j)2 - W V m J{t,U mk -u) 

Where the function / is the same as we encountered earlier. Thus an external perturbation at a 
given frequency most strongly induces transitions between energy levels separated by hu. 

This is another manifestation of an uncertainty principle. If the potential is electromagnetic, the 
most probable transition is the absorption of a Tiu rnk photon as the system changes energy by 



huj mk . But if the transition happens very fast, the peak is broad and the photon could have a wide 
range of energies, contrariwise, if the transition occurs after a long time the photon frequency is 
well defined: AEAt > K/2. This uncertainty gives rise to the 'natural linewidth' of a particular 
transition, and causes a limit to the accuracy of certain experiments. There is a slight difference 
from the Heisenberg Uncertainty in non-relativistic quantum mechanics because time is not an 
operator so one cannot define the commutator of time with the Hamiltonian. 

Note the extraordinary result that the transition probability at small times is (V^/ATi 2 ^ t 2 . Con- 
sider what happens if the state is measured frequently compared to if measurements are made 
infrequently: frequent measurement tends to inhibit the transition! 



5.5 Transitions to a group of states 

We are often interested in the situation where transitions take place not to a single final state but 
to a group, G, of final states with energy in some range about the initial state energy 

E k - AE <E m <E k + AE 

Then the total transition probability is obtained by summing the contributions of all the final 
states. The number of final states in the interval between E m and E m + dE m is g(E m ) dE m , where 
the function g(E m ) is known as the density of final states. The total transition probability for 
transitions to G is then given by 

1 rE k +AE 



1 r^ k +i\a 

PG( f ) = Z2 / \ y mk\ f(t,u mk )g(E m ) dE B 

n J E k —AE 



For sufficiently large t, and AE ^> 2irh/t, we observe that essentially the only contributions to 
the integral come from the energy range corresponding to the narrow central peak of the function 
f{t,uj mk ). Within this range we can neglect the variation of g(E m ) and V mk , which can therefore 
be taken out of the integral to give 



Pa(t) 



\V mk \ 2 pE k +AE 



ft 2 



g(E v 



/ f(t,Umk) dE m . 

JE k -AE 



Furthermore, we can extend the limits on the integration to ±oo. Noting that dE m = hdu; m k an d 
using the result that 



/ 



00 sin 2 a; 



x 2 



dx = 7T 



we obtain for the first-order transition probability 

2"7rt r. o r-, 

PG{t) = -y- [\V mk \ g(E n 



~-E k 



The transition rate, R, is just the derivative of this with respect to t and is thus given by the 
so-called Fermi Golden Rule: 



\v m k\ g(E r , 



E m —E k 



The Fermi Golden Rule is probably the single most widely used result in quantum mechanics. The 
factor of depends on the choice of perturbing potential, but the \V n i\ 2 g(E m ) term appears for 
any applied perturbation. Be careful about the density of energy states - one sometimes encounters 
density of frequency states (which differs by a factor of U) or of wavevector states. 

It may appear that need to know the density of final states, g(E m ), but this is not always true. 
In cases where \V mk \ = transitions are forbidden, and in some cases we can deduce g(E m ) from 
the relative rates of related transitions. 



5.6 Example of Golden rule - beta decay 

A nucleus decays via the reaction n — > p e~ V. to form a electron and antineutrino, releasing 
energy E . 

The simplest form for the matrix element describing nuclear /3-decay is given by the so-called Fermi 
ansatz V m k = GpM/VL where Q is the normalisation volume for the wavef unctions, \M\ 2 « 1 is 
the wavefunction overlap between initial and final nuclear states and Gp is a constant. 

We can work in the COM reference frame, so the kinetic energy of the nucleus is zero. Momentum 
is conserved, so the final state has nuclear, electron and neutrino momentum P + p + q=0 
while the energy released goes into the electron and neutrino, which for simplicity we treat as 
massless: E = E e + qc The proton and neutron are heavy compared with the electron and 
neutrino. Given that momentum must be conserved, the kinetic energy must be concentrated in 
the lighter particles. 

The density of final states for the electron is given by the phase space volume 

d 3 pd 3 r 

with a similar expression for the neutrino. Number of states in a volume of phase space is given by 
the number of electron states, times the number of neutrino states, provided energy is conserved: 



Using the relativistic relation E 2 = p 2 c 2 + m 2 c 4 implies — — = — - 

dE pc 2 

the normalisation volume is just / d 3 r = Q, and rotational invariance gives d 3 p = 47cp 2 dp. 
All of which which simplifies the integral to 



where E e is the electron energy and E v is the neutrino energy. What can actually be measured is 
the electron energy, so we integrate over the neutrino energies, 



This is the distribution of electron energies from beta decay: the rate fo emission of electrons at 
a particular energy is given by the Golden Rule 




E 






0.007-1 




0.006- 



Figure shows the simplest case of beta-emission: neutron 
decay. Conservation laws tell us that the electron energy 
must lie between its rest mass (0.51MeV) and the total 
energy available (0.7823MeV). But the entire shape can be 
deduced from geometry. 



0.005- 



0.004- 



0.003- 



0.002- 



0.001- 



0- 



0.55 0.60 0.65 0.70 0.75 



0.50 



5.6 Example of Golden rule - beta decay 

A nucleus decays via the reaction n — > p e~ V. to form a electron and antineutrino, releasing 
energy E . 

The simplest form for the matrix element describing nuclear /3-decay is given by the so-called Fermi 
ansatz V m k = GpM/VL where Q is the normalisation volume for the wavef unctions, \M\ 2 « 1 is 
the wavefunction overlap between initial and final nuclear states and Gp is a constant. 

We can work in the COM reference frame, so the kinetic energy of the nucleus is zero. Momentum 
is conserved, so the final state has nuclear, electron and neutrino momentum P + p + q=0 
while the energy released goes into the electron and neutrino, which for simplicity we treat as 
massless: E = E e + qc The proton and neutron are heavy compared with the electron and 
neutrino. Given that momentum must be conserved, the kinetic energy must be concentrated in 
the lighter particles. 

The density of final states for the electron is given by the phase space volume 

d 3 pd 3 r 

with a similar expression for the neutrino. Number of states in a volume of phase space is given by 
the number of electron states, times the number of neutrino states, provided energy is conserved: 



Using the relativistic relation E 2 = p 2 c 2 + m 2 c 4 implies — — = — - 

dE pc 2 

the normalisation volume is just / d 3 r = Q, and rotational invariance gives d 3 p = 47cp 2 dp. 
All of which which simplifies the integral to 



where E e is the electron energy and E v is the neutrino energy. What can actually be measured is 
the electron energy, so we integrate over the neutrino energies, 



This is the distribution of electron energies from beta decay: the rate fo emission of electrons at 
a particular energy is given by the Golden Rule 




E 






0.007-1 




0.006- 



Figure shows the simplest case of beta-emission: neutron 
decay. Conservation laws tell us that the electron energy 
must lie between its rest mass (0.51MeV) and the total 
energy available (0.7823MeV). But the entire shape can be 
deduced from geometry. 



0.005- 



0.004- 



0.003- 



0.002- 



0.001- 



0- 



0.55 0.60 0.65 0.70 0.75 



0.50 



6 Two state systems 
6.1 Time Dependence 

The exact expression for the time dependence of a system with N states required a set of iV 
simultaneous differential equations. One case where we can solve this problem exactly is when we 
have a small number of states. Consider a system which requires only two basis states. Say we 
prepare it in initial state |1) and we want to know how long it will take to go to the other state 
|2). From section 5, we have two coupled equations in the time dependent c\ and c 2 : 



iUc-y = VnCi + V^e 1 ^ 1 
ihc 2 = V 22 c 2 + V 2l c l e l ^ t 

where Ci(0) = 1 and c 2 (0) = 0. 

If the change is slow, we can use first order time-dependent perturbation theory. We thus replace 
the c n (t) by c„(0), and integrate whence: 



c\ ps exp(iVut/h) 



hi 2 ps l 



c 2 ps [ t V 21 e i ^ t dt 
n Jo 

Including the constant of integration for Ci(0) = 1. 
6.2 Notes 

• The 'Matrix element' V 2 \ determines whether there is a transition from an initial state 1 to 
a final state 2 even if V is independent of time. It also determines the rate of the transition. 

• If the states |1) and |2) are eigenstates of the perturbation V then V 2 \ — V\ 2 — and no 
transition occurs. 

• Over a long period of time, the system will oscillate between the two states. 

• Perturbation theory, in essence, ignores the third-order possibility of ending up in state 2 
via |1) -> |2) -> |1) -> |2) 

• The mathematics is the same as for two coupled pendula, where the energy moves back and 
forth between the two bobs. 

• The states can represent anything, and oscillation will occur whenever there are off diagonal 
terms in the matrix. 

• Examples: (see Feynman III Ch.9-11) Nitrogen atom in ammonia, electron in H^, pion 
exchange, benzene, electron spins, photon polarisation, neutrino oscillations, neutral kaons. 



6.3 Example: Oscillation in a fully mixing two state system 



Consider the expectation value of a quantity S in a system which has two non-degenerate energy 
eigenstates |1) and |2), and where the Hermitian operator S is defined by *S| 1) = |2), S\2) = |1). 

The general state can be written: 



|0) = ci exp(-i£it/fc)|l) + c 2 exp(-iE 2 t/h)\2) 
if we assume real ci, c 2 it follows that the expectation value (S) will be: 



(S) = (0|S|0) 



' Cl e iElt/h (l\ +c 2 e lE2t/h (2\] [c* ie ^ Elt/h \2) + c* 2 e- iE2t/h \l) 
Cl c 2 [e luJ2lt + e-^ 2lt ] 
2c\C 2 cos(a;2ii) 



Thus the expectation value of S oscillates in time at frequency uj 2 i = (E 2 — Ei)/h. This arises 
because S is not compatible with the hamiltonian, and hence does not define a constant of the 
motion. 



6.4 Neutrino Oscillations 

Neutrino oscillation is a phenomenon where a specific flavour of neutrino (electron, muon or tau) 
is later measured to have different flavour. The probability of measuring a particular flavour varies 
periodically. The three neutrino states are created by a radioactive decay in a flavour eigenstate 
as \fi), \f 2 ), \f 3 ) (electron, muon, tauon). However, these are not eigenstates of energy with a 
definite mass \mi), \m 2 ), \m 3 ). We can expand the flavour eigenstate using the energy eigenstates 
as a basis: 

\fi) = Y,( m i\fi)\ m i) 

j 

the energy eigenstates show how the wavefunctions behave in time, rrij(t) = m,j(0) exp(iujjt), where 
ojj = rrijC 2 /h. ujij = {rrii — rrij)c 2 /Ti. Consider an electron neutrino produced by a fusion reaction 
in the sun, $(t = 0) = its wavefunction then varies as: 

= Imj-XmjI/i) exp(iujt) 
j 

For real neutrinos, the (rrijlfi) matrix has non-zero, possibly even complex elements everywhere, 
but here for simplicity we suppose that 

(a c 
-c a 
1 

with a and c real, time independent and a 2 + c 2 = 1 for normalisation. Our electron neutrono 
then evolves as $(£) = a exp(iuit)\mi(t)) + cexp(iu 2 t)\m 2 (t)) , so the probability that some time 
later it is still an electron neutrino is 

|</i|$(t))| 2 = |a 2 exp(wit) + c 2 exp(^ 2 t)| 2 

= a 4 + c 4 + a 2 c 2 (exp(icu 2 it) + a 2 c 2 exp(—icu 2 it) 
= 1 -4a 2 c 2 sin 2 (cj 2 it/2) 



which is less than 1: it can somehow "turn into" a muon neutrino. Often, one writes a = sin6 in 
which case 4a 2 c 2 = sin 2 #. 6 is referee! to as a "mixing angle". 

If a = c = yj\, then with a frequency governed by the difference in masses, the electron neutrino 
turns completely into a muon neutrino, then back again. With smaller c, there's always some 
chance that it will still be an electron neutrino. In reality, it is also possible to oscillate into a 
tau neutrino. This underlies the "solar neutrino problem". Detection of solar neutrinos was the 
subject of the 2002 Nobel prize. Similar oscillation occurs in the kaon system due to a symmetry- 
breaking effect called "CP violation" subject of the 2008 Nobel prize. Here one of the states is 
subject to radioactive decay, so a particle not only "turns into" something else, it also disappears 
when it does so! 

6.5 Strong force - Two state system or degenerate perturbation 

The fundamental forces can be thought of as manifestations of two state systems. Consider a 
system comprising a proton and a neutron. The proton can decay into a neutron plus a pion, 
while the neutron can absorb the pion and become a proton. We can think of the system as 
two neutrons and a pion: the pion having two degenerate states \a) and \b) depending on which 
neutron it is located. The off-diagonal terms are now (a \ V a \ b), where V a is the potential energy of 
the pion due to neutron a. The two state analysis shows that we can think of the pion hopping 
back and forth between the neutrons (the pion exchange mechanism). Or we can treat the system 
by degenerate perturbation theory and diagonalise the 2x2 matrix to find energies: V aa ± V a b- The 
ground state has a binding energy of \V a b\ 

Note that V a b involves the overlap between the state with the pion on one site and the state with 
the pion on the other site. Obviously this depends on the separation (R), and so there is a force 
between the neutrons dE g /dR. As the nucleons move apart, the force depends on the tails of the 
wavef unctions, which in turn are exponentially dependent on the pion mass. Thus the strength 
of the strong force falls off exponentially with distance. 

Note also that we have described the basis states of our two state system as 'a proton and a 
neutron', but the actual ground state is a mixture of the two. When interacting via the strong 
force, the nucleons lose their well-defined identity. 

This picture of forces arising from exchange of 'virtual' particles (the pion is not observed as a 
free particle here) is the standard way of thinking about fundamental forces - the electromagnetic 
force involves 'exchange of virtual photons', the gravitational force 'gravitons' etc. These forces 
are long ranged (not exponentially decaying) because the particles involved have zero mass. 

All of this is analogous to covalent bonding: 'exchange of electrons': and in each case there is 
still another level of understanding lurking beneath to define the potential V: QED (photons) for 
electron-ion bonding and QCD (gluons and quarks) for nucleon binding. 



7 The H} Ion and Bonding 



As the simplest example of covalent bonding, we consider the hydrogen molecular ion. 

electron 



The hydrogen molecular ion is a system composed of 
two protons and a single electron. It is useful to use centre 
of mass (cm) coordinates by defining the relative position 
vector, R, of proton 2 with respect to proton 1, and the 
position vector r of the electron relative to the centre of 
mass of the two protons. 




proton 1 



O R 



proton 2 



The Schrodinger equation is 



h 2 „ 2 h 2 _ 2 e 2 e 2 e 2 ' 

V B V 1 

2/xia K 2fi e r (47re )ri (47re )r 2 (47re )i2 



^(r, R) = E^(r, R) 



where the reduced mass of the two-proton system is /ii2 = M/2, with M the proton mass, and \x e 
is the reduced mass of the electron/two-proton system: 

m(2M) 



m + 2M 



~ m 



where m is the electron mass. 



7.1 Born-Oppenheimer Approximation 



Because nuclei are a great deal more massive than electrons, the motion of the nuclei is much 
slower than that of the electrons. Thus the nuclear and electronic motions can be treated more or 
less independently and it is a good approximation to determine the electronic states at any value 
of R by treating the nuclei as fixed. This is the basis of the Born-Oppenheimer approximation. 

In this approximation, the electron is described by an eigenfunction Uj (r, R) satisfying the Schrodinger 
equation 



-v - 



+ 



C/,(r,R) = Ej(R) Uj(r,K) 



2pL e ' r (47re )r 1 (47re )r 2 (47re ).R 

This is solved keeping R constant. For each R, a set of energy eigenvalues Ej(R) and eigenfunc- 
tions Uj(r, R) is found. The functions Uj(r, R) are known as molecular orbitals. 

The full wavefunction for the j th energy level at given R is taken to be the simple product 

V(r, R) = Fj(R) Ufa R) 

where Fj(R) is a wavefunction describing the nuclear motion. 

Substituting this form into the full Schrodinger equation and using the electronic equation yields 

h 2 



2/112 

A little vector calculus gives 
V 2 , {Fj(R) Uj(r, R)} 



-Vi + EjCfy-E 



F,(R)f/,(r, R) = 



Y R -{Y R m^)u 3 (^ R)]} 

V R ■ {^(r, R) V R Fj(R.) + Fj(R) V R C/,(r, R)} 
^■(r.^V^F^R) + F,(R)V 2 R U 3 (r,R) 

+ 2(y H ^(r J R)).(y B F i (R 



C\ A 



Assuming that the variation of the molecular orbitals with inter-proton separation, R, is weak, 
we can neglect the terms involving V R [/j(r, R), and V 2 R Uj(r, R) leaving a single-particle type 
Schrodinger equation for the nuclear motion 

1 F^R) = 



■^-V\ + E 3 {R)-E 



in which Ej(R) plays the role of a potential. We will return to this later. 
7.2 The Electronic Ground State 

We now try to investigate the lowest electronic levels of H 2 . First we look for symmetries, and 
note that, since r x = r + R/2 and r 2 = r — R/2, the electronic Hamiltonian is invariant under the 
parity operation r — > — r. If V denotes the parity operator, then 

[V,H] = 

These are commuting operators, so they have can have the same eigenfunctions. These eigenfunc- 
tions are called gerade if the parity is even and ungerade if the parity is odd: 

VU g 3 (r, R) = C//(r, R), VUJ(r, R) = -UJ(r, R) 

Now think about wave functions. If R is large, the system separates into a hydrogen atom and 
a proton (two degenerate states). The hydrogen atom has a large spacing between levels, so we 
use degenerate perturbation theory with Is levels only. Quite generally, this procedure of taking 
linear combinations of atomic orbitals is known as the LCAO method. Note that this basis set is 
normalised, but neither complete nor orthogonal. 

Since there must be solutions which are eigenfunctions of the parity operator, we take normalised 
linear combinations of gerade or ungerade symmetry of Is orbitals: 

r = [u ls (n) + u ls (r 2 )]/V2 and ^ M = [« l8 (n) - u ls (r 2 )]/V2 

We calculate the expectation value of the electronic Hamiltonian using these LCAO molecular 
wavefunctions: 

E 9 ' U (R) = Jr' u *(r, R)Hr' u (r, R) d 3 r = <u lfl (n)|#K(n)> ± («i.(ri)|^|«i s (r 2 )) 

where + and - correspond to u and g respectively, giving E 9 (R) and E U (R) for each value of R; 
The evaluation of the integrals is complicated, but the results have the form: 

E 9 (R) = E ls + e ' x + ^/ Q o) exp(-2fi/q ) + [1 - (2/3) (R/a ) 2 } exp(-fi/q ) 



and 

E U (R) = E u + 



(47re )fl 1 + [1 + (R/a ) + (l/3)(i?/a ) 2 ] exp(-i?/a ) 

e 2 (1 + R/a ) exp(-2R/a ) - [1 - (2/3)(R/a ) 2 } exp(- J R/a ) 



(47re )i? 1 - [1 + (R/ao) + (l/3)(i?/a ) 2 ] exp(-R/a ) 

where ao is the Bohr radius and Eu is the ground-state energy of atomic hydrogen. 

The two curves E 9 — E u and E u — E u are plotted as a function of R. Note that the curve which 
corresponds to the symmetric (gerade) orbital exhibits a minimum at R = R , where Ro/a^ ~ 2.5, 
corresponding to E 9 — E ls = —1.77 eV. Since this is an upper bound on the ground-state energy, 
this implies that there is a stable bound state, a molecular ion. The curve represents an effective 
attraction between the two protons. By contrast, the curve corresponding to the ungerade orbital 
has no minimum, so that a H 2 ion in this state will dissociate into a proton and a hydrogen 
atom. If we think of the protons being attracted by the electron and repelled by each other, the 
symmetrical state should be the more tightly bound because the electron spends more of its time 
between the protons, where it attracts both of them. This is an example of covalent bonding. 



E g ' u -E ls (eV) 
4.0 - 



3.0 



2.0 



1.0 



-1.0 



-2.0 



-3.0 



; e s -e 



Is 



\ E-E 



Is 



1.0 « 2.0 3.0 



4.0 



R/a, 



5.0 ,---'6.0 



7.3 Rotational and Vibrational Modes 

We can now study the effective one-body Schrodinger equation for the nuclear motion by setting 
Ej(R) = E 9 (R) for the ground state. Because E 9 (R) only depends on the magnitude of R it 
represents an effective central potential, so the solutions are of the form 



F 9 (R) 



R 



n NL {R)Y LMl {eA) 



where Ylm^O, 4>) are the spherical harmonics and the function TZnl(R) satisfies the radial equation 

h 2 ( d 2 L(L + 1) 



+ E 9 (R)-E 



2/i 12 \dR 2 R 2 

We can approximate the centrifugal barrier term by setting it equal to its value at R — i? , writing 

h 2 



E r — 



2fj, 12 R 



L(L + 1) 



In this approximation we are treating the molecule as a rigid rotator. We can also approximate 
E 9 (R) by Taylor expanding about R = Rq. Because this point is a minimum, the first derivative 
is zero: 

E°(R) ~ E 9 (R ) + l -k{R - R ) 2 + ■ ■ ■ 
where k is the value of the second derivative of E 9 at R = Rq. 
With these two approximations, the radial equation becomes 



h 2 



2fi 12 dR 2 2 



4- \k(R- R ) 2 - E N 



c\r> 



where 

E N = E- E 9 (R ) - E r 
This is the equation for a simple harmonic oscillator with energies 

E N = hu (N + l), N = 0,1,2,- ■■ 

where ojq = \JkJJi\2- The vibrational energies are of the order of a few tenths of an eV, whereas 
the rotational energies are of the order of 10~ 3 eV. Both are much smaller than the spacing of the 
electronic levels. Transitions between these various levels give rise to molecular spectra. The pure 
rotational spectrum consists of closely-spaced lines in the infrared or microwave range. Transitions 
which also involve changes to the vibrational state give rise to vibrational-rotational band spectra 

7.4 Electronic states of the H 2 Molecule 

Electrons are fermions with spin |, so the gerade state can be double occupied, as can the unger- 
ade state (four states in all, same as two Is orbitals for each ion). The second electron changes 
the structure of the wavefunction. Staying within LCAO, and ignoring spin, we can label basis 
states as, e.g. u\ s (r 2 ) indicating the first electron on the second atom. The electrons are indis- 
tinguishable, so the total wavefunctions (spin times spatial) must be eigenstates of parity and the 
exchange operator P\ 2 which switches the electron labels, e.g. Pi 2 u\ s (r 2 ) = ul s (r 2 ). They are 
fermions, hence antisymmetric: P = — 1. 

Assuming both electrons are Is and in the bonding g state, and ignoring their interaction, the 
LCAO Is 2 spatial wavefunction is 

V>(ri,r 2 ) = K 00 (ri) + w} 00 (r 2 )] [« 2 00 ( ri ) + « 2 00 (r 2 )] 

This must be combined with a spin eigenfunction |f, J, J,, (fj + jf)> or (Tl ~~ IT)j where the first 
arrow represents the spin state (m s = ±1) of the first electron. Since the spatial wavefunction 
is symmetric under label exchange, in fact it must be combined with the antisymmetric spin 
wavefunction fj, — |f to give the overall wavefunction in spin and space. 

ip(r 1 ,r 2 , si,s 2 ) = [M} 00 (ri) +M} 00 (r 2 )][M 2 00 (ri) +M 2 00 (r 2 )][t| - If] 

This wavefunction describes two electrons, and is non- degenerate. 

The second electron also adds an electron-electron repulsion to the Hamiltonian, which can be 
treated by perturbation theory. 

AE= (iP(r 1 ,r 2 )\e 2 /4Tre \r 1 -r 2 \ |^(ri,r 2 )) 

There is a lot of subtlety here, since the electrons don't interact with themselves, only with each 
other, and we must avoid double-counting the interaction of 1-2 and 2-1. We'll return to this in 
more detail later in the context of Helium. 



8 The Variational Principle 



8.1 Approximate solution of the Schroedinger equation 

If we can't find an analytic solution to the Schroedinger equation, a trick known as the varia- 
tional principle allows us to estimate the energy of the ground state of a system. We choose 
an unnormalized trial function $(a n ) which depends on some variational parameters, a n and 
minimise 

E[an] - ^wr 

with respect to those parameters. This gives an approximation to the wavefunction whose accuracy 
depends on the number of parameters and the clever choice of $(a n ). For more rigorous treatments, 
a set of basis functions with expansion coefficients a n may be used. 

The proof is as follows, if we expand the normalised wavefunction 

|0(a n )) = $(a n )/($(a n )|$(a„)) 1 / 2 

in terms of the true (unknown) eigenbasis \i) of the Hamiltonian, then its energy is 

E[a n ] = ^mmm = e k#>i 2 ^ = e + e k#>i 2 (^ - ^ > e 

ij i i 

where the true (unknown) ground state of the system is defined by H\io) = Eq\iq). The inequality 
arises because both |(0|i)| 2 and — E ) must be positive. 

Thus the lower we can make the energy E[ai], the closer it will be to the actual ground state 
energy, and the closer \<p) will be to \i ). 

If the trial wavefunction consists of a complete basis set of orthonormal functions \Xi), each 
multiplied by af \<p) = J2i a i\Xi) then the solution is exact and we just have the usual trick of 
expanding a wavefunction in a basis set. Alternately, we might just use an incomplete set with a 
few low-energy basis functions to get a |<3>) close to the ground state |io). In practice, this is how 
most quantum mechanics problems are solved. 



8.2 Excited States 



The variational method can be adapted to give bounds on the energies of excited states, under 
certain conditions. Suppose we choose a trial function $i(/3 n ) with variational parameters f3 n . 
which is made orthogonal to the ground state (fio, by imposing the condition (0o|0i) = 0. 

If we know \4> ) = \i ), then similar to the above 

EK] = { ~i¥wv = Ei^mmm) = e k^ioi 2 ^ = o+^+e mm^-E^ > El 

\ 1 1 1/ ij i i=2 

So the variational method gives an upper bound on the first excited-state energy, and so on. We 
can satisfy (io\4>i) = if \io) is known, or if it has a known symmetry from which we can exploit 
(e.g. if \i ) has even parity, chosing |$x) to be odd.) 

In general, though, we only have a variational estimate of the ground state <po(a n ). In this case the 
expression above, subject to the constraint {4>i(f3 n )\<j)o(a n )) = 0, gives an estimate of E\ . However, 
the error in this approach will be larger than for Eq because not only is the wavefunction incorrect, 
but also the constraint (0i|0o) = is not quite correct; using an approximate ground state does 
not guarantee that we get an upper bound for the excited states. 

If the excited state has different symmetry from those of the lower-lying levels, and we choose trial 
functions with the correct symmetries, orthogonality is guaranteed and we get an upper bound to 
the energy of the lowest-lying level with those symmetries, which is the excited state. 



8.3 Analytic example of variational method - Binding of the deuteron 



Say we want to solve the problem of a particle in a potential V(r) = -Ae-' r / a . This is a model for 
the binding energy of a deuteron due to the strong nuclear force, with A=32MeV and a=2.2fm. 
The strong nuclear force does not exactly have the form V(r) = —Ae~ r '/ a , unlike the Coulomb 
interaction we don't know what the exact form should be, but V(r) = —Ae~ r l a is a reasonable 
model. 

The potential is spherically symmetric, most attractive at r = and falls rapidly to zero at large r, 
so we choose a trial wavefunction which does the same, say = ce~ ar l 2a . This has only one dimen- 
sionless variational parameter, a. The value of c follows from normalisation / c 2 e^ ar ^ a Airr 2 dr = 1; 
which gives c 2 = a 3 /87ra 3 . (The Arcr 2 comes from the problem being three dimensional). 

According to the variational principle, our best estimate for the ground state using this trial 
function comes from minimising (<p\H\(p) with respect to a. 

^ 2 poo poo 

(0|™>/(</#> = -^- c 2 ( e ~ ar/2a V 2 e~ ar/2a ) Arcr 2 dr - A / c 2 exp \-(a + l)r/a] Aixr 2 dr 
2m Jo v ' Jo 

h 2 a 2 , ( a 

= - A 

8ma z \a + 1 

From this we find the minimum for E(a) at «o 

^ = ^-3A(V^V)=0 =► {a ° + 1)4 = 12 Ama 2 /h 2 

da Ama 2 \(a + l) 4 / a 

Solving for a gives ao = 1.34, and substituting back into (<f>\H\(f>) gives E = —2A4MeV. 

This is fairly close to the exact solution for this potential, which can be obtained analytically as 
a Bessel function of \ / 8mA(a/ h)e~ r ^ 2a if you manage to spot that change of variables! The exact 
solution gives E Q = -2.245MeV. 



8.4 Quantum forces: the Hellmann-Feynman Theorum 

For many systems one is often interested in forces as well as energies. If we can write the energy 
of a in state as E = (<f)\H\(f)) and differentiate with respect to some quantity a then 

But since H\4>) = E\<f>) and (<j>\(/>) is 1 for normalisation: 

dE , , , dH , , . „ d , , , , , , , , dH , , . 

-r- = ^h-^ + E ^\<t>) = fahd* 

da da da da 

This result is called the Hellmann-Feynman theorem: the first differential of the expectation value 
of the Hamiltonian with respect to any quantity does not involve differentials of the wavefunction. 

e.g. if a represents the position of a nucleus in a solid, then the force on that nucleus is the 
expectation value of the force operator It can be applied to any quantity which is a differential 
of the Hamiltonian provided the basis set does not change. 

Caveat: if we use an incomplete basis set which depends explicitly the positions of the atoms, 
then we have |0) = J2 n ,i \ u n,i( r ))- This give spurious so-called "Pulay" forces if is not an exact 
eigenstate. 



8.5 An aside about Kinetic Energy 



The expectation value of the kinetic energy (T) is always positive. This can be shown by an 
integration by parts in which the first term vanishes provided the wavefunction tends to zero at 
infinity (which it will for a bound state). In ID: 



(T) = / $* — -§dx = $ -r- $ ™ H / -i-® —®dx = — 

w 2m J dx 2 2m 1 dx J "°° 2m J dx dx 2m 



— $ 
dx 



dx 



The second term integrand is positive everywhere, so the kinetic energy is always positive. 

8.6 Variational Method in MAPLE 

The variational method is exceptionally well suited to computer algebra packages such as maple. 
The procedure is as follows: 

• Define Trial wavefunction $ 

• Evaluate Normalization factor |c 2 | = ($|$) 

• Evaluate unnormalised kinetic energy 

• Evaluate unnormalised potential energy (V) = (&\V\&) 

• Differentiate with respect to variational parameters D an = -£-((T) + (V))/c 2 

• Solve D an = for all a n 

• Substitute optimal value for a n into $. 

• Evaluate [(T) + (V)]/c 2 using optimised wavefunction. 

If one needs to do another variational calculation for a different potential and trial wavefunction, 
only definitions 1 and 3 need to be changed. 

8.7 Density functional theory (Nobel prize 1998) 

If we consider the total probability density of a system of many interacting particles p(r), there 
may be several possible wavefunctions which could give rise to it: call this set 5 , ($). 

Now, consider the expectation value of the energy (H). We know from the variational principle 
that (H) > E Q . If we define a functional _F[p(r)] = Mins($) (H ) , then it follows that F[p] > E . 

Consequently we can use the variational principle to find the p(r) which minimises the value of F, 
and this may give us the ground state energy without having to evaluate the wavefunction. This 
is especially useful when the wavefunction consists of complex combinations of many different 
single-particle wavefunctions, as with the many electrons in a solid or molecule. 

The drawback is that for interacting electrons, the functional is not known. 



8.8 Kohn-Sham functional 



For solids, we have 10 26 electron states. Analytic solution becomes impossible. In the past 20 
years the density functional theory has come to dominate condensed matter physics, extending to 
chemistry, materials, minerals and beyond. 

A popular form of DFT functional was introduced by Nobel laureate Walter Kohn and Lu Sham: 

™ - ™ + 2 / + EM + ? / *5£V r ' 

Nobody has found a satisfactory functional for T. What is generally used is: 

h 2 



2m 



which is the kinetic energy of non-interacting "quasiparticles" and depends explicitly on the wave- 
functions. The integrals represent electrostatic interactions between the electrons and between 
electrons and ions, and E xc is 'everything else'. The advantage of this form is that it can be 
recast to give a set of one-particle equations with non-interacting fermions moving in an effective 
potential: 

" " h 4^ |Rio„ - r'| + J 4ne \r - r'f + 5p(v) 

Since V e ff depends on p(r) these equations must be solved self-consistently. 

Thus the density functional theorem shows that the problem of solving the Schroedinger equation 
for a collection of interacting electrons can be transformed to that of a system of non-interacting 
'quasiparticles', with the cost that the Hamiltonian depends on the electron density p(r): 



H[p(r)}fa = Etfc where p(r) = J2 l& 



r 



|2 



Thus the Schroedinger equation is a nonlinear differential equation of many variables. Thus we 
must turn to the variational method. The most general approach here is to use a Fourier Series 
(plane wave basis set). The wavefunction for the ith electron is then written as 

(pi = Cjk exp(— ik.r) and the variational equation becomes : E = Min (fa \ if (p) \ fa) 

k i 

The accuracy of the ground state energy of the electrons is determined by the number of Fourier 
components used. The wavefunctions are expanded in a computer-friendly basis set and the 
variational principle is used to transform the problem from a set coupled non-linear differential 
equations into a minimisation of a single function of many variables. Most structural properties 
of materials depend only on the electron ground state. 

The single particle eigenstates of Kohn-Sham functional are not proper single electron states: 
indistinguishability means there is no such thing. Nevertheless, they are Bloch states, and they 
do exhibit well defined symmetry and energy "band-structure" which can help with interpretation 
of the electronic structure 



o -i 



9 Indistinguishable Particles and Exchange 



Quantum mechanics allows us to predict the results of experiments. If we conduct an experiment 
with indistinguishable particles a correct quantum description cannot allow anything which distin- 
guishes between them. For example, if the wavefunctions of two particles overlap, and we detect 
a particle, which one is it? The answer to this is not only that we don't know, but that we can't 
know. Quantum mechanics can only tell us the probability of finding a particle in a given region. 
The wavefunction must therefore describe both particles. The Schroedinger equation is then: 

-2 

$(!-!, r 2 ) = £$(ri,r 2 ) 



where the subscripts label each particle, and there are six coordinates, three for each particle. $ 
is a wave in six dimensions which contains the information we can measure: the probability of 
finding particles at ri and r 2 , but not what we can't measure: which particle is which. 

What basis states would be appropriate for $? An approximation is to use a product such as 
^ > ( r i, r 2) = |a(i"i)6(r 2 )) where a(ri) and b(r 2 ) are one-particle wavefunctions of atoms 1 and 2. 
This allows us to separate the two particle equation into two one particle equations: 

[^V? + ^MKrx)) = EMn)); + V(r 2 )]\b(v 2 )) = E 2 \b(v 2 )) 

provided that the particles do not interact (n.b V? does not act on b(r 2 )). 

Unfortunately, by doing this we have introduced unphysical labels to the indistinguishable particles. 
And this is wrong: the effect of it is that the particles do not interfere with each other because 
they are in different dimensions (six dimensional space - remember?). When we construct a two- 
particle wavefunction out of two one-particle wavefunctions we must be ensure that the probability 
density (the measurable quantity |$| 2 ) is independent of the artificial labels. 



9.1 The exchange operator and Pauli's exclusion principle 

We introduce the exchange operator P 12 : an operator which permutes the labels of the particles. 
This is a rather strange operator, because it only changes the unphysical labels which we have 
attached to the one-particle wavefunctions in order to make the maths more easy. For a meaningful 
solution we must have a wavefunction which has a probability amplitude unchanged by P\ 2 : it 
must be symmetric or antisymmetric with respect to exchange: |$(ri,r 2 )) = ±|$(r 2 ,ri)). 

Physical solutions must be eigenfunctions of P\ 2 with eigenvalues +1 (bosons) or —1 (fermions). 
Also, any physically meaningful Hamiltonian must commute with P 12 , otherwise H and P\ 2 could 
not have common eigenfunctions and the system could not remain in an eigenstate of exchange. 

A simple product wavefunction \a(r 1 )b(r 2 )) does not satisfy this (unless a = b). A linear combi- 
nation of all permutations is required, for two particles: 

|$-> = |a(r 1 )6(r 2 )-a(r 2 )6(r 1 ))/v^ 

|$+> = C a6 |a( ri )fc(r 2 ) + a(r 2 )6( ri )> + C aa |a(r 2 )a(n)) + C»|6(r 2 )6(ri)> 

where the C a b terms are expansion and normalisation parameters. Note that the antisymmetric 
combination cannot include terms where both particles are in the same state, but there are three 
possibilities for the symmetric state. Although any linear combinations of C a b Cbb and C aa = 1 
are possible, Cbb an d C aa correspond to different configurations and are usually set to zero. 

Notice that if a = b, then |$~) = 0. Thus there is no possible antisymmetric combination 
involving identical states, i.e. two fermions cannot be in the same quantum state: the Pauli 
exclusion principle. 



9.2 Two indistinguishable particles with spin 1/2 

If we have two identical fermions of spin 1/2, confined in the same region, what is the appropriate 
wavefunction? In the scattering case we could measure spins far from the interaction, and if we 
knew that the total spins is conserved, spins can be associated with each particle. In the bound 
state we cannot tell which particle we are measuring, so the ket must contain both spin and spatial 
wavefunctions of both particles. 

Assuming the spins do not interact, we can separate the two-particle spin wavefunction into 
c(l,2) = <7i<72. We also know the appropriate one particle basis states fi, ii, T2, I2, where |i 
represents "particle 1" in spinor state ( J ) ■ The combinations for indistinguishable particles are 
then: 

T1T2, I1I2, (T1I2 + liT 2 )/v / 2, (T1I2 - Iit 2 )/v / 2 

Operating on these with P 12 yields eigenvalues 1,1,1 and -1 respectively. S 2 = S(S + 1) yields 
2, 2, 2 and 0, S z yields 1,-1,0 and 0. Thus the demands of indistinguishability couples the spins 
of two identical particles into a triplet (S=l) and a singlet (S=0). The spin-1 vector has three 
possible M s component values - hence the triplet. 

9.3 The exchange interaction 

The overall wavefunction describing fermions must be antisymmetric with respect to exchange, 
i.e. Pu\®) = — Therefore in an atom or molecule where <3> includes both spin and spatial 
parts, the spin and spatial parts of a fermionic wavefunction have opposite exchange symmetry. 

Spin must be considered even if the energy (Coulomb potential) depends explicitly only on the 
spatial part. The expectation value of the potential energy is different for symmetric and anti- 
symmetric spatial combinations. Using from above (with C a b = 1)- 

($±|y|$±) = (a^b^lVirMr^b^)) ± (a(r 1 )6(r 2 )|y(r)|a(r 2 )6(r 1 )> 

The first term is called the direct interaction and the second term is known as the exchange 
interaction: a measurable contribution to the energy comparable in size to the first, which has no 
classical analogue. 

9.4 Spins and Exchange 

Now notice something strange. The exchange interaction has split the S=l states from the S=0 
states. We could write the potential as V = J n i — (2S — l)K n i, even though the Hamiltonian 
does not act on the spin! This is because the sign of the exchange integral depends on the 
(anti) symmetry of the spatial wavefunction. Thus we can write the matrix element as 

(®\J nl -(2S-l)K nl \<f>) 

This 'exchange interaction' appears to depend on the spin - the triplet states have lower energy 
than the singlet (this is one of Hund's rules for determining energy levels in atoms). It is this type 
of exchange force which keeps spins aligned in a ferromagnet, not the magnetic interaction itself. 



9.5 Wavefunction for many spin one-half particles 



The exchange arguments for two-particle systems can be extended to many particle systems: The 
indistinguishable wavefunction consists of all possible permutations of the product of one electron 
wavef unctions. For the symmetric case P nm & = a product of these permutations will suffice. 
For the antisymmetric case, the correct form turns out to be given by the determinant of a matrix: 



det 



/ 0a(l) <M1) 

0a(2) b (2) 



<Mi) \ 

<M2) 



V <p a {N) mn) ... <MiV) J 



This is called a Slater Determinant. For fermions, where P nm & = — $ the Slater Determinant 
obeys the Pauli exclusion principle: if any two of the one-particle wavefunctions were identical 
(4> n = <f> m ), then the wavefunction would be the determinant of a matrix with two identical rows, 
i.e. zero. 

Note also that has many more exchange terms than direct ones. 



9.6 Helium 

Helium is the simplest system for which we are unable to accurately calculate the energy. 

For a single electron moving in the field of a helium nucleus, the spatial wavefunctions are similar 
to those of hydrogen \u n i m ). 

When a second electron is added, a reasonable basis set is exchange-symmetrised wavefunctions 
consisting of spin states multiplying hydrogenic spatial parts: 

{Unlm{ri)u n n> m >{r 2 ) ± U n n'm'(jl)U n l m {Y 2 )) 

Since the overall wavefunction must be antisymmetric, the singlet (exchange-antisymmetric) spin 
states must combine with symmetric spatial states, and the triplet (exchange-symmetric) spin 
states must combine with antisymmetric spatial states. 

If both electrons were in the same spatial state, the antisymmetric spatial wavefunction would be: 

|(a(ri)a(r 2 ) - a(r 2 )a(r 1 ))} = 
Hence there is no triplet for the ground state. 



9.7 Electron-electron interaction - ground state by perturbation theory 

The hydrogen wavefunctions are only a choice of basis set: the hydrogenic potential ignores the 
electron-electron repulsion. A simple approach is to treat this as a perturbation and to use 
degenerate perturbation theory. 

The perturbing potential is just V = e 2 / Aite^r^ where r±2 = |ri — r2 1. The unperturbed spatial 
ground state is just a product of the hydrogenic ones with Z=2 for helium: 

«ioo(ri)«ioo(r 2 ) = ^-^1-^-^1*. 
ixal 

so by perturbation theory, the energy shift due to this potential is given by: 

(Mioo(n)Mioo(^2)|e 2 /47reori 2 |Mioo(^i)Mioo(^2)) 



O A 



The electron-electron repulsion is over 30% of the unperturbed energy (AZfie^/h 2 ), so perturbation 
theory may seem inappropriate. Strictly, it isn't even the right integral, as it neglects correlation. 
But in fact the value of this integral is 5Z fxe 4 /8h 2 within 5% of the actual energy. 

Note also that the radial wavefunctions are different for 2s and 2p, so the electron-electron inter- 
ation splits the degeneracy between ls2s and ls2p configurations. 



9.8 Multiplicity and Degeneracy of Excited States 

Ignoring electron-electron interaction, all ls2s and ls2p states have the same energy. The pertur- 
bation (e 2 /47reor 12 ) lifts that degeneracy, and we can treat it with degenerate perturbation theory. 
Rather than evaluating the integral in the 4x4 matrix exactly, we can use a physical argument: 
(e 2 /47reori 2 ) is not an external potential, and so applies no net torque or force on the electrons. 
The perturbation cannot change the angular momentum, so it cannot mix states with different I 
or m. The theta integral will be 5a>, and the phi integral 5 mm i, total angular momentum remains a 
good quantum number: L=0(ls2s) or L=l(ls2p). Since the 2s state has finite probability of being 
at the nucleus, and the 2p has zero probability of being there, the 2s state is less well screened 
from the nuclear charge by the Is and will have lower energy. 

For a given spatial excited state the possible normalised 

spin wavefunction combinations, consistent with the anti- / lp i 

symmetry requirement are a spin triplet and a spin singlet. /,'--'' p 2,1,0 

//'' .. 's 

/>:> 3s . 



(Is) (2p), (ls)(2s) 



1 S 



$3 — ( <f>nlm,n'l'm' ~ <t>n'l'm' ,nlm) (TT)/v^ 
(0n£m, n'l'm' — 4>n'l'm',nlm){[[)/ V% 
(<f>nlm,n'l'm' ~ ^> n'l'm' ,nlm) (Tl + IT)/2 
$1 = Wnlm,n'l'm' + <f>n'l'm',nlm){t I ~ IT)/2 

Where \4>nim,n'i'm') represents electron 1 in a hydrogenic 
state with quantum numbers n, I and m and electron 2 
with n',l', and m! . The subscripts on the <3> label spin (ls) 2 
multiplicity (2S+1) 

Again whole effect of the potential is contained in the spatial part, the spin integral will be 5 aa r. 
so off-diagonal matrix elements are all zero. We need to evaluate 

Jni = ((f>nim,n'i'm'\(e 2 /4:Tt:e ri2)\(f) n i m yi>m'} - the direct integral. 

K nl = (0 nlm,n'l'm' \(e 2 /47re r 12 )\4>n'i'm',nim) - the exchange integral. 

with which perturbation theory gives an energy shift in the ls 1 2s 1 state of: 

1 e 2 

2 47T6 t^ 100 ' 200 ^/ r 12 1 0100,200) + (0200,100 1 l/ r 12 1 02OO,lOo)=t (0100,200 1 1/ r 12 1 02OO,lOo) ± (0200,100 1 l/ r 12 1 01OO,2Oo) ) 

where the + applies to the singlet state and the — to the triplet. The direct integral, electron- 
electron repulsion, increases the energy, but the exchange integral can either increase of decrease 
energy. 

Thus the energy levels are split by different direct interactions into L=0 and L=l and again 
through exchange interaction into singlet and triplet. The final degeneracies of states with one 
electron excited to n=2 are 3,1,9 and 3. The spectroscopic notation in the figure gives the quantum 
numbers as: (nl)(n'l') 2S+l Lj 

Again, the most useful quantum number labels are the total spin and angular momentum: we could 
write the perturbation energy as AE = J n \ — (2S — l)K n i, even though the perturbing potential 
does not act on the spin. The 'exchange force' selects preferred spin state via the requirement of 
overall antisymmetry. 



10 Self-consistent field theory 



An important unsolved problem in quantum mechanics is how to deal with indistinguishable, in- 
teracting particles - in particular electrons which determine the behaviour of almost every object 
in nature. The basic problem is that if particles interact, that interaction must be in the Hamil- 
tonian. So until we know where the particles are, we can't write down the Hamiltonian, but until 
we know the Hamiltonian, we can't tell where the particles are. 



10.1 Hartree-Fock theory 

The idea is to solve the Schroedinger equation for an electron moving in the potential of the nucleus 
and all the other electrons. We start with a guess for the trial electron charge density, solve Z/2 
one-particle Schroedinger equations (initially identical) to obtain Z electron wavef unctions. Then 
we construct the potential for each wavefunction from that of the nucleus and that of all the other 
electrons, symmetrise it, and solve the Z/2 Schroedinger equations again. 

Fock improved on Hartree's method by using the properly antisymmetrised wavefunction (Slater 
determinant) instead of simple one-electron wavefunctions. Without this, the exchange interac- 
tion is missing. This method is ideal for a computer, because it is easily written as an algorithm. 



Guess Wavefunction 




Calculate Charge Density 




Calculate Potential 









I 



Solve Schroedinger equation 
I 

Calculate Charge density 



Is charge density 
Same as before? 



Yes 




Figure 5: Algorithm for Self-consistent field theory. 

Although we are concerned here with atoms, the same methodology is used for molecules or even 
solids (with appropriate potential symmetries and boundary conditions). This is a variational 
method, so wherever we refer to wavefunctions, we assume that they are expanded in some ap- 
propriate basis set. 

The full set of equations are 
^(r) = (-^V 2 + VWr)) + £/ dr'^^(r) -£<W. / dv' ^^ ^v) (3) 

The first term is the kinetic energy and electron-ion potential. The second "Hartree" term, is 
the electrostatic potential from the charge distribution of N electrons, including an unphysical 
self-interaction of electrons when j = i. The third, "exchange" term, acts only on electrons with 
the same spin and comes from the Slater determinant form of the wavefunction. 

Physically, the effect of exchange is for like-spin electrons to avoid each other. Each electron is 
surrounded by an "exchange hole" : there is one fewer like-spin electrons nearby than the mean-field 
would imply. The term i = j neatly cancels out the self interaction of the electron. 



10.2 Self-consistent fields 



Iterative, self-consistent approaches similar to the Hartree-Fock method can be used to calculate 
properties of atoms, solids or molecules. All that changes is V ion . 

For non-central potentials appropriate boundary conditions are needed (e.g. periodic in the case 
of crystals). One of the main problems now is to select an appropriate basis set for the problem. 
Various options exist: Plane waves, atomic orbitals, 'augmented' plane waves which wiggle more 
near to the nuclei, gaussian or 'muffin tin' orbitals which are localised on the nuclei. There is still 
a huge amount of research going on in this area. 



10.3 Correlation: conditional probability 



Hartree-Fock theory does not properly describe correlation. In the Copenhagen Interpretation, 
the squared modulus of the wavefunction gives the probability of finding a particle in a given place. 
The many-body wave function gives the N-particle distribution function, i.e. |3>(ri, ttv)| 2 is the 
probability density that particle 1 is at r x , and particle N is at r N . 

But when trying to work out the interaction between electrons, what we want to know is the 
probability of finding an electron at r, given the positions of all the other electrons {n}. This 
implies that the electron behaves quantum mechanically when we evaluate its wavefunction, but 
as a classical point particle when it contributes to the potential seen by the other electrons. 



10.4 Lattice methods: Variational method by computer 



The variational method transposes the problem of solving a differential equation onto the problem 
of minimising a function of many variables. It is therefore good for use with computers. 

One of the simplest ways of solving for the ground state of a system with a computer is to use 
a basis set consisting of the values of \(f>) defined on a lattice. In ID such a solution is simply a 
histogram where we adjust the wavefunction at each point until the energy of the whole system 
is minimised. The kinetic energy (second derivative of the wavefunction) must then be obtained 
by some interpolation method. The weights of \<f>) at each point can be regarded as a basis set. It 
is not complete, but it becomes more and more complete as the lattice gets finer. 

Another common way of solving the Schroedinger equation numerically is to write the wavefunction 

as a Fourier series. , s „ ., 

$(b) = Y> b e* b - r 

b 

where ay, are the variational parameters. Using Fourier series (also called plane wave expansions) 
has several advantages. Increasing accuracy can be obtained by adding more Fourier components 
(because each plane wave is orthogonal to all the others), the value of $(r) can be quickly found 
by a Fourier transform of a(k) and the kinetic energy has particularly simple form because 

-ft 2 /$*V 2 ^ 3 r = E b ^|g 
2m J$*$d 3 r £b«b 

which requires no numerical differentiation if used on a computer. The wavefunctions must be 
normalised, e.g. with Periodic Boundary Conditions. For a periodically repeating crystal, these 
are exactly the correct boundary conditions anyway. In condensed matter physics plane waves 
contrast with using LCAO as basis functions. 



10.5 Pseudopotentials 

A drawback to using plane waves is that electron wavefunctions don't actually look much like 
plane waves, so the basis set is very different from the wavefunctions, and very many Fourier 



components are required. One way around this is to use a 'pseudopotential' which attempts to 
describe the potential due to the nucleus and tightly bound shells of 'core' electrons which do not 
take part in bonding. In silicon for example the pseudopotential describes the nucleus and the 
Is2s2p electrons. 

The pseudopotential can be deduced from properties of the perfect atom: Consider: 

h 2 v 2 

V(r)$(r) = E + ^^$(r) 
2m 

Where we know atomic properties E and $(r), but not V(r), the potential seen by the outer 
electrons. We can invert the Schroedinger problem, solving for V(r) to give the exact $(r) outside 
some core radius r > r c , but smoothing it out for r < r c . 

In most applications involving chemical binding, the wavefunction only changes in the region 
outside r c . So although the pseudowavefunction is not the correct Kohn-Sham eigenfunction, 
changes in its energy due to interaction with other electrons and ions are the same as the change 
in the Kohn-Sham eigenfunction. 

Choosing r c and inverting the Schroedinger equation is non-unique, but in general: 

Pseudopotentials depend on the / quantum number, because they must include the fact that, e.g. 
3s must be radially orthogonal to Is and 2s, while 3d are automatically so because of the angular 
dependence. This is called non-locality. 

The core charge produced by the pseudo wavefunctions must be the same as that produced by the 
atomic wavefunctions. This ensures that the pseudo atom produces the same scattering properties 
as the ionic core. 

Pseudo-electron eigenvalues must be the same as the valence eigenvalues obtained from the atomic 
wavefunctions. 

Pseudo wavefunctions must be continuous at the core radius as well as its first and second derivative 
and also be non-oscillatory. 

If you find it surprising that this works - it is! However tens of thousands of calculations give 
energies correct to within a few percent, so the approach seems to accord well with reality. 

10.6 k-point sampling 

DFT reduces the problem of 10 26 interacting electrons to 10 26 noninteracting quasiparticles. To 
reduce this to a manageable number, we recall that electrons in solids can be labelled by a 
wavevector k, and that they form bands in which electrons with similar k have similar energy. 
The energy is the integral of these, thus we can obtain a good estimate by sampling states from 
an evenly-spaced grid of "k-points". As this grid becomes finer, so the accuracy of the integral 
improves. For most systems a surprisingly small number suffices: tens for insulators and hundreds 
for metals. 

According to the Bloch theorem, any wavefunction must be written: 

$k = u( r ) exp — ik.r 
If the wavefunction is expanded in plane waves, then 

$ k = ex P ~^( k + b ) r 

b 

where k correspond to Bloch waves longer than the unit cell, and b to basis function plane waves 
shorter than the cell (i.e. b > k). 



10.7 A continuum of quantum states: quantum numbers in a crystal 



In a crystal quantum states can be indexed by the Bloch quantum number k. In the LCAO 
approximation, there is a state for each possible atomic orbital at each value of k. As the number 
of electrons tends to infinity, the allowed k's form a continuum. The most important application 
of quantum mechanics in solid state physics is to understand the relationship between energy and 
momentum. A graph of energy vs momentum is called a band structure. 

States are occupied from the lowest energy upwards according to the exclusion principle. The set 
of momenta which correspond to the maximum allowed energy form a surface in the 3-d space - 
the so-called Fermi Surface. 

Shown is the valence "band structure" of dhcp potassium calculated using DFT and pseudopo- 
tentials: letters are crystallographic notation for values of k (T = (0,0,0), others are on the edge 
of the Brilloiun zone). Note the free electron parabola around r, as E = h 2 k 2 /2m. This structure 
has four layers of atoms per unit cell, so on average there are two bands below the Fermi surface at 
each /c-point (each is spin degenerate). There are lots of bands crossing the Fermi level, showing 
that electrons can move from one state to another without requiring energy: potassium is a metal. 
T-A is quite a short distance in k-space, corresponding to waves along the long direction in the 
unit cell: the band structure appears like a parabola "folded back" on itself. 



> 




M K 



r a 



HKH LML 



Figure 6: Band structure of potassium, energy scaled so that E F =0. x-axis labels denote a path 
through the 3d space of k-vectors. 



11 Fundamentals of Quantum Scattering Theory 



11.1 Centre of Mass Frame and the Two-body Problem 

The problem of a particle in a given potential can be solved classically from Newton's equations. 
The Schroedinger equation can be used to describe the behaviour of one particle in a field. 

The problem of two particles interacting via conservative fields can be reformulated into two parts: 
the behaviour of the centre of mass and the behaviour of the relative velocities of the particles. If 
we work in the centre of mass frame (COM), then the behaviour of the centre of mass is trivial, 
and we need worry only about the relative motions. This can be described by a single effective 
particle with effective mass a = mi , m2 . This effective particle can then be treated with one 

1 ™ mi+m2 

particle equations. 

The problem of three interacting particles cannot be reduced in this way. Hence the 'three-body- 
problem' is in general insoluble. 

The COM transformation allows us to treat the scattering problem as a one body problem. For 
scattering problems we work in the COM frame, describing two real particles as an effective 
particle moving in a potential. Do not forget that for any experiment we will have to apply the 
above transformation to relate theory to the experimental results, though if the target particle is 
much heavier than the other the transformation may be slight. Note also that this transformation 
is invalid if there is an external field. 

11.2 Some terminology for general scattering 

The incident flux (I) of particles with momentum p= fik is the number of incident particles 
crossing unit area perpendicular to the beam direction per unit time. 

The scattered flux (S) of particles with momentum p' = hk.', is the number of scattered particles 
scattered into the element of solid angle dfl about the direction 9, <p per unit time per unit solid 
angle. 

The differential cross section is the ratio of the scattered flux in direction 9, </> to the incident flux. 

The total cross section is the ratio of the scattered flux in any direction to the incident flux. 

da 



°t = / / ~!pz sin 9d9d(f) 
J J dll 



11.3 Scattering in one dimension- Step function 

Firstly, we review the problem of scattering by a step function in one dimension. Consider a 
particle moving from a region (x < 0) where the potential is V = to a region (x > 0) where the 
potential is V = V . 

Assuming the particle energy E > Vo, this is simply the free particle problem, the spatial solution 
to which is: 

$ = Aexp(ikx) + B exp(-ikx) (x < 0); $ = Cexp(ik'x) + D exp(-ik'x) (x > 0) 



where k = V^mE/h and k' = ^2m{E - V )/h 

a r\ 



A 

V 



Aexp(ikx) + Bexp(-ikx) 



Cexp(ik'x) 



Figure 7: Scattering at a step function. 



From the boundary condition that all particles start from x = — oo, we can immediately set D=0. 
From the condition of continuity of $ and d§/dx at x = we also require A + B = C and 



k(A -B) = k'C 

This gives the reflected amplitude B/A = (k — k') /(k + k') and the transmitted amplitude C/A = 
2k/{k + k') 



Note that A 2 ^ B 2 + C 2 . The conserved quantity is the flux of particles, not the probability 
density. In this case the transmitted particles are moving more slowly than the incident ones. 

Notice that if V is negative, the transmitted flux gets smaller as |Vo| gets larger: it is difficult 
to fall off a big cliff! This anomaly is due to the unphysical potential - the discontinuous first 
derivative at x — 0. 

We have not considered the case of E < V . Now the square root is imaginary and $(x > 0) = 
Ce~ K ' x where we define a real quantity k' = ik' = ^2m(V — E)/h. The boundary conditions are 
then A+B = C and ik(A-B) = k'C, which gives the reflected amplitude B/A = {ik — k') / (ik + k') 
and the transmitted amplitude Cn'/Ak — 2ik/(ik + k'). 

Now the reflected flux is equal to the incident flux, and although the wavefunction penetrates the 
region x > 0, it decays exponentially and there is no propagating wave. 



The reflected flux is thus 




and the transmitted flux is 




2 



11.4 Scattering in one dimension - Square Well 



The square well potential has V(x < 0) = V (x > a) — 0; V(0 < x < a) — Vq. As with the step 
function, we can write the wavefunction as a plane wave in each of the three regions. 

$(x < 0) = Aexp(ikx) + B exp(-ikx) 
$(0 < x < a) = Fexp(ik'x) + Gexp(-ik'x) 
$(x > a) = C exp(ikx) + D exp(-ikx) 

Once again there is no wave coming back from x = oo (D — 0). 

There are now four boundary conditions from continuity of the wave function and its derivative 
at x=0 and x=a. The solving of four equations in four unknowns is straightforward but tedious. 
Eventually one can obtain ratios for reflected and transmitted flux: 



B/A 



{k 2 - k' 2 ){l - e 2ik a ) 
(k + k') 2 — (k — k') 2 e 2ik ' a 



C/A 



4kk'e i{k '~ k)a 



(k + k') 2 — (k — k') 2 e 2ik ' a 



where k 2 = 2mE/h 2 and k' 2 = 2m(E — V )/h 2 . Since the wavenumber is the same on both sides 
of the barrier, the reflection and transmission coefficients are just: 



\B/A\ 2 = 



1 + 



Ak 2 k 



2U2 



(k 2 - A;' 2 ) 2 sin k'a 



1 + 



AE(E - Vq) 
V£ sin 2 k'a 



\C/A\ 2 = 



1 + 



(k 2 - A;' 2 ) 2 sin 2 k'a 
4k 2 k' 2 



1 + 



Vq sin 2 k'a 
4E(E - Vq) 



-i -i 



We get complete transmission when k'a = mi, i.e. when an exact number of half waves fit in the 
well. 



Assuming that E > V . Looking at the limits of this, we see that as E 
and the transmission coefficient 



Vq then sin 2 (/c'a) — * k'a 



\C/A\< 



1 + 



mV a 2 
~2h 2 ~ 



As the incoming particle energy is increased, the transmission oscillates between 
and 1 at k'a = nit. The lower limit itself increases to 1 as E increases. 



1 + 



AE(E-Vq) 



For the tunnelling case where E < V we can use these solutions for B/A and C/A, except that 
k! is now imaginary. This gives 

-l 



\C/A\ 2 = 



1 + 



4E(E - V ) 
Vq sinh 2 \ k'\a 



which decreases monotonically with decreasing E. Thus a small change in Vq can give a large change 
in \C/A\ 2 . This is the principle on which the transistor and the tunnelling electron microscope 
are based. 

Note that the transmitted wave $(x > a) = Cexp(ikx), differs from the incident wave only by 
a phase - it has the same wavevector. Thus the only effect of the potential on the transmitted 
particles is to change their phase, an idea we shall meet again. 



A C\ 



Resonant Transmission 



Partial Transmission 



Tunnelling 




Figure 8: Forward moving wavefunctions passing a square well potential 

11.5 The transistor (1956 Nobel) and giant magneto-resistance (2007 Nobel) 

Transistors can be modelled as a barrier potential, with the voltage across them represented by 
different potentials on either side. 

The rapid variation in transmission coefficient (current) with change in potential barrier (voltage) 
is the basis of the transistor The name come from 'transfer resistor'. The resistance to motion 
of electrons past the barrier is determined by the voltage V in the barrier region more than the 
voltage difference across the transistor. 

Actual behaviour also depends on the availability of electrons for conduction, which depends in 
turn on the material in question, since there must be available electron states of appropriate energy 
on each side of the barrier. 

In GMR a series of barriers are created from layers of ferromagnetic material and a spacer chosen to 
make the layer align antiferromagnetically (e.g. FeCrFe). Conduction electrons with spin opposite 
to the magnetic moment pass easily through iron (there are many state available to them). So 
oppositely aligned layers form a series of barriers to either spin. An external magnetic field applied 
to the GMR causes all the ferromagnetic layers to align, meaning there is no barrier to antialigns 
conduction electrons. Thus a magnetic field causes a change in resistance: GMR heads are used 
to "read" the magnetisation states in computer hard disks. 




A O 



12 Scattering in three dimensions 



12.1 Cross sections and geometry 

Most experiments in physics consist of sending one particle to collide with another, and looking 
at what comes out. 

The quantity we can usually measure is the scattering cross section: by analogy with classical 
scattering of hard spheres, we assuming that scattering occurs if the particles 'hit' each other. The 
cross section is the apparent 'target area'. The total scattering cross section can be determined by 
the reduction in intensity of a beam of particles passing through a region on 'targets', while the 
differential scattering cross section requires detecting the scattered particles at different angles. 

We will use spherical polar coordinates, with the scattering potential located at the origin and the 
plane wave incident flux parallel to the z direction. In this coordinate system, scattering processes 
are symmetric about 0, so ^ will be independent of 0. 

We will also use a purely classical concept, the impact parameter b which is defined as the distance 
of the incident particle from the z-axis prior to scattering. 




Figure 11: Standard spherical coordinate geometry for scattering 



12.2 The Born Approximation 

We can use time-dependent perturbation theory to do an approximate calculation of the cross- 
section. Provided that the interaction between particle and scattering centre is localised to the 
region around r = 0, we can regard the incident and scattered particles as free when they are far 
from the scattering centre. We just need the result that we obtained for a constant perturbation, 
Fermi's Golden Rule, to compute the rate of transitions between the initial state (free particle of 
momentum p) to the final state (free particle of momentum p'). 

The Hamiltonian for a single particle being scattered by a fixed potential as 

H = Hq + V(r) where H = — , the kinetic energy operator 

2m 

and treat the potential energy operator, V"(r), as the perturbation inducing transitions between 
the eigenstates of Ho, which are plane waves. 

If we label the initial and final plane-wave states <3>i n = exp(ik.r — icut) and § SC at = exp(ik'.r — iuj't) 
by their respective wave- vectors, then Fermi's Golden Rule for the rate of transitions is 

R=^-\(k'\V\k)\ 2 g(E k ) 

where g(E k ) is the density of final states; g(E k )dE k is the number of final states with energy in 
the range E k —> E k + dE k . 



The quantity (k'|V"|k) is known as the matrix element of the perturbation and is usually abbrevi- 
ated thus 

V k/k = (k'\V\k)= jjj <,(r)y(i> k (r)dr. 

The time variation has been suppressed here. For constant potential, the only non-zero terms 
come from 00 = u/: elastic scattering. For a time oscillating potential (e.g. V(r) sinc^t) the 
non-zero contribution comes from u = uJ ± ujq: inelastic scattering where the scattered particle 
gains/loses a quantum of energy from/to the system providing the potential. 



12.3 Box Normalisation and Density of Final States 



Plane-wave states have wavefunctions of the form: «k iW (r) = C exp(?(k.r — out)) with C a normal- 
isation constant. Because plane- wave states are not properly normalisable we employ the trick 
of normalising them in a large (relative to potential range) cubic box of side L with periodic 
boundary conditions. We then take the limit L — > 00 at the end of the calculation. 



Thus we require that 



III 



OH» dr=\C\ 2 dr=\C\ 2 L i = l 

box J J <J box 

giving for the normalised eigenfunctions: u kjUI (r) = L~ 3 / 2 exp(?k.r — uot) 

Of course, enclosing the system in a finite box has the consequence that the allowed momentum 
eigenvalues are no longer continuous but discrete. With periodic boundary conditions 

u(~,y,z) =u(^,y,z), etc. 
the momentum eigenvalues are forced to be of the form 

2,71 fly 

p = Tik = —j—(n x , n y , n z ), with n x , n y , n z = 0, ±1, ±2, . . . 

For sufficiently large L, we can approximate the continuous spectrum arbitrarily closely. 

Any possible final-state wave-vector, k, corresponds to a point in wave-vector space with coordi- 
nates (k x , k y , k z ). The points form a cubic lattice with lattice spacing 2n/L. Thus the volume of 
/c-space per lattice point is (2n/L) 3 , and the number of states in a volume element d 3 k is 

3 

k 2 dk dQ 



We require g(E k ), the density of states per unit energy, where: E k = h 2 k 2 /2m is the energy 
corresponding to wave- vector k'. Now, the wave- vectors in the range k' — > k' + d 3 k' correspond 
to the energy range E k —> E k + dE k) so that 

g(E k ) dE k =[^j k 2 dkdQ 

is the number of states with energy in the desired interval and with wave- vector, k', pointing into 
the solid angle dVt about the direction (9, 0). Noting that dE k = (% 2 k/m) dk yields the final result 
for the density of states, 

. « L 3 mk ,^ 



12.4 Incident and Scattered Flux 



The box normalisation corresponds to one particle per volume L 3 , so that the number of particles 
crossing unit area perpendicular to the beam per unit time is just given by the magnitude of the 
incident velocity divided by L 3 : 



incident flux = 



|p|/m hk 
L 3 mL 3 



Using the Golden Rule, we have that the rate of transitions between the initial state of wave-vector 
k and final states whose wave-vectors k' lie in the element of solid angle dQ about the direction 
(9,<f>) of the wave-vector k', is given by 

h 87r 3 h 

but this is just the number of particles scattered into dQ per unit time. To get the scattered flux 
we simply divide by dfl to get the number per unit time per unit solid angle. 

12.5 The Differential Cross-Section 

We now have all the ingredients, the scattered flux and the incident flux, to compute the cross- 
section: 

da scattered flux mL 3 2n ,„ 2 L 3 mk 



dil ~ incident flux hk' h 8ir 3 h 

Noting that, for elastic scattering, k' = k, we obtain finally the so-called Born approximation for 
the differential cross-section: 



do" m 



2 



dfi 4n 2 h 4 

where the matrix element Vk'k = (k'|U|k) is given by 



(k'|V|k> 



2 



MV\k) = ± J J J V(r) expH X .r)dr 



with x = k' — k, the so-called wave-vector transfer. Thus the required matrix element in the Born 
approximation is just the 3-dimensional Fourier transform of the potential energy function. The 
total scattering cross section is simply: 

da ,„ r r da 



a T = ( %-dQ = 11 %- sin0d0# 
J d\l J J ail 



Observe that the final result for the differential cross-section is independent of the box size, L, 
which we used to normalise the plane-wave states. 

12.6 Further Simplification to ID for Conservative, Central Potential 

Consider a central potential V(r) = V(|r|) where energy is conserved |k'| 2 = |k| 2 . Here x is a 
vector of length 2k sin | where 6 is the scattering angle. 

We can make some progress with the matrix element integral if we choose a polar coordinate 
system with x along the z-axis, so that x- r = X rcos $- Since we are trying to integrate over all 
space this change does not affect the limits of the integral. 

r2w r+1 roc 



Vk'k = y o 1 J V(r)e- lxrcos9 r 2 drd(cos6) 



roc Q-iXr _ e ixr 4^ roo 

2n / V(r)r 2 dr = — / rV(r) sm(yr)dr 

Jo -% X r X h 



A f 



But since |k| = |k'|, \x\ = 2/csin |, Whence we obtain the most useful form of the Born approxi- 
mation: 



da 



m 



dtt (ksm e M* 



rV(r) sin(2A;r sin -)dr 
o 2 



Thus the scattering cross-section is independent of (due to cylindrical symmetry of the problem). 
Note that this shows that the differential cross section does not depend on scattering angle and 
beam energy independently, but on a single parameter x- By using a range of energies for the 
incoming particles, k, this dependence can be used to test whether experimental data can be well 
described by the Born Approximation. 

The most common use of the Born approximation is, of course, in reverse. Having found ^ 
experimentally, a reverse Fourier transform can be used to obtain the form of the potential. 



12.7 Example of Born Approximation 

Consider scattering of particles interacting via a 3D square well potential: V(r < a) = V ; 
V(r > a) = 0. 

The integral required here is then (with x — 2 A; sin |): 



r-a 

\ rV sm(x r )dr 
Jo 



whence: 



da 
dtt 



2/iVq 

xh 2 



sin(yr) — x r cos(yr) 
X 2 



sin(ya) - yacos(ya) 



r 



Using a Maclaurin expansion, the low energy limit is: 



da 




2 1 


1 1 2 2 


dn ~ 


[Xh 2 _ 


9 


1 y a 

5 A . 



From integrating over 9 and <p the low and high energy limits for the total cross section are 



a(E -> oo) = 2vr 





2 


~V a 3 ~ 






ka 



a(E -> 0) = 2tt 



- II- 


2 


~V a r 






ka 



9 V 5 ; 



12.8 General Notes on Scattering in the Born Approximation 

The square well illustrates some general feature of scattering in the Born approximation: 

• Born approximation is based on perturbation theory, so works best for high energy particles. 

• Scattering depends on Vq, so both attractive and repulsive potentials behave the same. 

• At high energy, cross section is inversely proportional to the energy (E = h 2 k 2 /2m) 

• Dependence on k and 9 arises only through the combination x — 2 A; sin |. Thus as energy 
increases, the scattering angle 9 is reduced and the scattered beam becomes more peaked in 
the 'straight on' direction. 

• Angular dependence depends on the range of the potential a but not on the strength V . 

• Total cross section depends on both range a and depth V of the potential. 



13 Further Concepts in Quantum Scattering Theory 

13.1 Born Series, Green Functions - A Hint of Quantisation of the Field 

Solving the Schroedinger equation using Green Functions automatically gives a solution in a form 
appropriate for scattering. By making the substitution E = h 2 k 2 /2fi and U(r) = (2fj,/h )V(r) we 
can write the TISE as: 

[V 2 + A; 2 ]$ = C/(r)$ 

For U(r) — this gives (f>o(r) = Ae tk r , a travelling wave. We now introduce a 'Green's Function' 
for the operator [V 2 + k 2 } , which is the solution to the equation: 

[V 2 + k 2 ]G(r) = 5(r)G(r) G(r) = - exp(ikr)/47cr 

S(r) is the Dirac delta-function as is 5(r)G(r), since G(r) diverges at the origin. G(r) has the 
property that any function $ which satisfies 

$(r) = o (r) + J G(r - r')U{r')<$>{r')d 3 r' 

where <j>o(r) is the free particle solution, will be a solution to the TISE. Since <f>o(r) is the unscat- 
tered incoming wave, the second term must represent the scattered wave. 

Thus the general solution to the TISE is given by: 

$( r ) = Ae lk - r + J G{r- r')U(r')$(r')d 3 r' 

In this expression, $ appears on both sides. We can substitute for $ using the same equation: 

$( r ) = Ae tk - r + j G(r- r')U(r')Ae ikr ' d 3 r' + j J ' G(r - r')U(r')G(r' - r")U(r")<5>(r")d 3 r'd 3 r" 

Repeated substitutions gives the Born series, terminated by a term involving $(r) itself. If the 
potential is weak, the higher order terms can be ignored. The first order term is just the matrix 
element between the incoming plane wave and the Green function: the Born approximation again! 
If we think of the potential U as an operator, the first term represents the incoming wavefunction 



+ 



+ 



No Scattering 



+ 



Single Scattering ~ U 



+ Double Scattering ~ U 



+ Multiple Scattering 



Figure 12: Born Series - scattering as series of terms 

being operated on once. The second term represents the incoming wavefunction being operated 
on twice. And so forth. This suggests a way of quantising the effect of the field: The first order 
term corresponds to a single scattering event, the second order term to double scattering etc. 



a n 



13.2 Scattering of distinguishable particles and identical particles 

Consider two beams of distinguishable particles with the same mass colliding, and scattering 
through some angle 9. Let the intensity of the scattered particles have angular dependence \f(9)\ 2 . 
Conservation of energy and momentum ensure that the scattering angles are the same for both 
particles in the COM frame. As usual, the radial part of the wavefunction far from the region of 
interaction is simply a plane wave so the wavefunction can be written as a function of 9. 

The intensity for the process in which both particles are scattered through an angle (it — 9) is 
\f(n - 9)\ 2 . Note that this process results in particles arriving in the same places as with f(9) - 
it is just the other particles (see diagram). 

If the two particle beams are distinguishable they cannot interfere and differential cross section 
for either particle to be detected at 9 is: 

I dls = \f(9)\ 2 + \f(7T-9)\ 2 

If, however, the particles are indistinguishable bosons (fermions), they can interfere and the com- 
bined wavefunction must (anti) symmetric under exchange of labels: 

$fe = f(9) ± f(7T - 9) I)Z = \f(9) ± f(n - 9)\ 2 

Taking the specific extreme example of scattering through ir/2, the differential cross section is 
2|/(7r/2)| 2 for distinguishable particles, 4|/(7r/2)| 2 for identical bosons, and for identical fermions. 




Figure 13: Two indistinguishable scattering processes. 



13.3 Scattering of indistinguishable particles into the same state 

Consider scattering of two indistinguishable bosons by an external potential. The wavefunction 
describing the bosons must be symmetric with respect to exchange. Thus the cross section for 
scattering of both through the same angle is: \2f(9)\ 2 : two bosons are twice as likely to be 
scattered into the same state as two distinguishable particles. For many bosons the effect is even 
more pronounced, and the probability of scattering out of the state is similarly reduced. 

The tendency for bosons to clump into one state leads to superfluid behaviour in He 4 and super- 
conductivity: a particles and Cooper pairs behave as bosons. All the particles are in the same 
state and cannot be scattered out. 

For fermions, the cross section for being scattered into the same state is \f(9) — f(9)\ 2 = 0, as we 
would expect from the exclusion principle. 



a r\ 



13.4 Collision between two unpolarised electron beams 



In this case, half the collisions will be between like-polarised electrons, so will involve interference, 
and half will be between unlike electrons: so there would be no interference. In both cases \f{9)\ 2 
represents Rutherford scattering. The differential cross section of finding an electron scattered 
through an angle 9 is thus: 

i = \{his+hn d ) = l(\m\ 2 + \f^-0)\ 2 ) + \\f(0)-f^-0)\ 2 

Consider 9 = n/2. The like polarised beams give zero probability, so unpolarised beams give only 
half what we would expect from Coulomb scattering of distinguishable particles. Furthermore, 
the spins of pairs of electrons scattered through 9 = it/ 2 are always observed to be opposite. 

An alternate philosophy is that we should treat the spins as a symmetric triplet and an an- 
tisymmetric singlet, with probabilities | and |. Then the spatial scattering process must be 
antisymmetric in the first case and symmetric in the second. This gives the same answer! 



13.5 Scattering of identical free particles with a periodic potential 

For a free particle moving in a ID region of space there are two degenerate wavefunctions ($ = 
e ±lkx ). If there is a weak periodic potential, V cos ax, to evaluate the energy shift to first order in 
degenerate perturbation theory the relevant matrix elements are: 

J e ±ikx V cos axe Tikx dx = J V cos axdx = ; J e ±ikx V cos axe ±ikx dx = J V cosax cos2kxdx 



The second term is also zero, except in the case 2k = a. This gives rise to the remarkable result: 
To first order, free particles are unaffected by a periodic potential unless it has half the wavelength. 
This is the basis of Bragg's Law, x-ray and neutron diffraction. 

13.6 Scattering of free electrons in metals 

If we describe an electron bound in a solid or liquid as a free electron, we see that scattering occurs 
only for those electrons with wavenumbers close to periodic repeats. For simple metals (Li, Na 
etc) the highest occupied free-electron level has wavelength greater than any crystal spacing, so 
it only sees the average of the ionic potential. 

To first order, only electrons with the periodicity of the lattice are scattered. To second order in 
perturbation theory, the potential can mix states: 

AE = J^' r ; Vn = [ e ±i(a/2+s)x V cosaxe ±t{a/2 - s)x ^ 
(Ej -Ei) J 

which gives significant energy shifts for states ±5 from the lattice periodicity (Ej—Ei = —h 2 a8/m). 
Thus free-electron levels with k ~ a/2 are split by periodic potentials giving a bandgap in the 
density of allowed states. At first glance, this may seem to be totally different physics from the 
LCAO band gaps we saw earlier. In fact, its simply another manifestation of using two different 
mathematical basis sets to describe the same physical phenomenon. 

13.7 Low energy Scattering: Partial Waves 

The Born Approximation is a perturbation method based on the Fermi Golden Rule and is there- 
fore valid when the incoming particle energy is large compared to the potential. An alternative 
approach is needed at low energy. For a central potential, scattering geometry plane wave in, 
radial wave out, implies a wavefunction: 



I *) = Incident Wave + ScatteredWave = = e ikz + f(9)e ikr /r 



The incident flux is / = ve lKz e - lKz = v = hk/m. The scattered flux must be a normalisable plane 
wave (hence e~ tKr /r), with a 9 dependence arising from the scattering. By symmetry, there is no 
dependence. Thus the scattered flux per unit area will be: vf*(6)f(8)/r 2 . The cross section 
da/dfl = S(9)/I = f*(9)f(9), and all we need do is solve the Schroedinger equation and calculate 
f(0). 

For a spherically symmetric potential, the angular parts of the wavefunction are simply spherical 
harmonics, so scattering is described by the radial equation: 

d 2 uAr) 1(1 + 1) , N 2u rrn Tr/ . . 

- -^-Mr) + ^[E - V(r)} Ul (r) = 

where ui(r) = rRi(r), the same substitution as in the atomic hydrogen problem. Assuming a short 
range potential, V(r — > oo) = 0, Ri(Kr — > oo) describes a free particle, with some phase Itt/2 — Si. 

Ri(Kr) = sm(Kr - Zvr/2 + 5 t )/Kr 

Thus the effect of the scattering at long range can be described by a set of phase shifts 5[. 

To solve further, we expand a plane wave into angular momentum components using a complete 
set of spherical harmonics and Bessel Functions: 

oo 

exp(iKr cos 9) = ^i^(Kr)(2/ + l)P ; (cos^) 

1=0 



so that we can write: 

*(r) = e iKz + f(9)— =J2 il ji(Kr)(2l + 1)^(^9) + f(9)— = Y J hRi{Kr)P l (cos9) 



AKr oo AKr 



/=o r 1=0 



where bi are expansion coefficients for the expression of ^ in the partial wave basis, which can be 
determined from the boundary r — > oo, giving: 

oo 

f{9) = R- 1 ^(2/ + l)e iSl sui5iP l {cos9) 

1=0 



From this we can calculate da/dQ = \f{9)\ 2 and o = 2n J \ f(9)\ 2 d9. Differential cross sections 
da/dVt are complicated, involving many cross terms. However, when integrated over all 9 these 
cross terms vanish due to orthogonality of the Legendre polynomials (Pi\Pv) = (7 ^ /'), and 

A-k 00 

^ = ^E(^ + l)sin 2 ^ 
^ 1=0 



Hence scattering cross sections are completely determined by \K\ and the phase shifts £/. This is 
most useful in the low energy limit (S-wave scattering) where any particle with / > must be so 
far from the target (impact parameter b = l/hk) that it will miss. 

Note the term (21+1). This can be related to the classical 'impact parameter' mentioned above. 
The angular momentum of a particle of velocity v is mvb = ^l{l + l)h. Thus a classical (large /) 
particle with angular momentum ITi would pass between a ring of radius b = 1%/mv and one of 
radius b = (I + l)h/mv. The area between these rings is (21 + l)ir(h/mv) 2 so for a uniform beam 
the probability of a particle having angular momentum / is proportional to (21+1). 



14 Using Partial Waves 

14.1 Impact Parameter and Classical Analogies 




Figure 14: Relation between classical and quantum angular momentum 

Knowing the impact parameter gives us some classical idea of whether a scattering event is likely. 
If the impact parameter is larger than the range of the potential, then classically the particles 
would miss. In the quantum case, we expect this to mean that the phase shift for that angular 
momentum is zero, and hence that the contribution from that term in the expansion is zero. 
Thus at a given incoming momentum, hk, we can determine how many terms in the partial wave 
expansion to consider from hkb max ~ l ma x^, where b max is the maximum impact parameter for 
classical collision, i.e. the range of the potential. 



14.2 S-wave scattering 

Although exact at all energies, the partial wave method is most useful for dealing with scattering of 
low energy particles. This is because for slow moving particles to have large angular momentum 
(Hkb) they must have large impact parameters b. Classically, particles with impact parameter 
larger than the range of the potential miss the potential. Thus for scattering of slow-moving 
particles we need only consider a few partial waves, all the others are unaffected by the potential 
(5i ~ 0). Thus partial waves and the Born approximation are complementary methods, good for 
slow and fast particles respectively. 

For very low energy we need consider only the first term in the partial wave expansion. This is 
known as S-wave scattering. In this case it is possible to solve for the differential cross section, since 
only the first term in the series for f(8) is involved: Since the angular variation is Pq(cos6) — 1 
the scattering is isotropic. 

! = |/ W |' = *-W* 

At higher energies, other angular momentum components come into play. For a given / component, 
scattering is maximised for Si = ir/2. 



14.3 Resonance 



In some cases where a potential has a bound state of particular angular momentum, the scattering 
of particle with that angular momentum will be especially enhanced. In such cases the total 
scattering cross section will show a peak, and the angular distribution will be characteristic of the 
appropriate P/(cos 6). This very strong scattering is known as resonance and is a powerful method 
for studying bound states. 

14.4 Example of S-wave scattering - Attractive square well potential 

An example where we can solve for the phase shift is the 3D-square well potential: 

(V(r <R) = -V ; V(r > R) = 0). 

For the / = case the radial equation with U = R r is 

d 2 uo(r) 2u rrn 

^ + |[£-y(r)]« o (r)=0 
The solutions to this are familiar from the ID square well. If we write 



K = pn[E + V ]/h; K = ^2fiE/h 

then for r < R, u(r) = AsinK r + B cos K r. 

and for r > R, u(r) = C sin Kr + D cos Kr. which can easily be written in a different form to 
show the appropriate phase shift S : u(r) = Fsin (Kr + S ) where (C = F cos5 ; D = Fsm8 ) 

As with the ID square well, the boundary conditions are that u and ^ are continuous at R, which 
lead to: 

K tan KqR = K ttm(KR + 5 ) or 5 = tan" 1 tan K r) - KR 

\K J 

In the low energy case KR < 1, we obtain maximum scattering (sin 2 5 1) when KqR = 
(n+ |)7r, when the scattering cross section is a = 4n/K 2 . This is an example of s-wave resonance. 

In the same slow particle limit K <^ K , and assuming that tan K R is not very large: So ~ sin<5o. 

9 / tan KnR 

This correctly predicts that when tan K R = K R the scattering cross section will be zero. 

There are a few features of the square-well which also apply in more general cases. Assuming K 
is basically a measure of the potential depth. 

• For weak coupling K R <C 1, S (K) — > as K — > 

• When KqR approaches 7r/2 the potential is almost able to bind an s-wave bound state. Now 
the phase shift S (K) — > ir/2 and the cross section diverges like K~ 2 as K — > 0. This is 
known as zero energy resonance. 

• If E is high enough that 5i = (n + |)/T for I ^ the scattering cross section can become 
especially high due to another angular momentum component - p-wave resonance for I — 1, 
(i-wave resonance for I = 2 etc. In these cases the eigenfunction becomes large near to the 
potential. The potential is said to have virtual states at the resonance energies. 



• Levinson's Theorem states that ,. x n \ 

hm di{k) = ri\n 

k^O 

where rii is the number of bound states with angular momentum I. 

• Whenever 5o(K) = nn, for s-wave scattering, a = 0. Thus for certain energies of the 
incoming particle, the scattering is extremely small. This condition can only be consistent 
with the condition for s-wave scattering (KR <C 1) if the potential is attractive (V < 0). 

• 5 (K) tends to decrease with increasing K. This can be understood physically as the faster 
particles having less time to interact and thus experiencing smaller phase shifts. As K — > oo, 
Si(K) — > because the potential is now weak relative to the particle energy. Of course 
a(K — > oo) decreases even more quickly because of the K~ 2 term. 

14.5 Partial Waves in the Classical Limit - Hard Spheres 

Consider the scattering of a small hard sphere (radius x m , mass m) by a large hard sphere (X M , 
M). Firstly we transform the problem to the centre of mass reference frame where it becomes 
that of a single effective particle of mass fi = mMj {m + M) moving in a hard sphere potential 
(V(r < th — X M + x m ) = oo). Thus the boundary condition is Ri(r H ) = 0. 

Consider the classical limit, where the sphere radius is much larger than the de Broglie wavelength, 
kr H ^> 1. Up to I = Kth the phase shift is enormous and sin 5/ could have any value. For 
I > Kr H the impact parameter is so large that the particles miss and 5i = 0. Thus we can write 
the scattering cross section: 

4„ l=Kr H 1 
^ 1=0 z 

where we replace sin 2 Si with its average value of \. 

Since Krn is large, we can replace the sum by an integral and take only the leading term; 
(Kr H f ^> Kr H : 

2ti rl=Kr H 

K 2 Ji=o 

This result should send us rushing back to look for the extra factor of 2, since the cross-section 
of a sphere might be expected to be 7rr# 2 . In fact, though, the analysis is correct and closer 
analysis of the dependence of the wavefunction shows that half the amplitude is diffracted into 
the classical 'shadow' of the sphere to cancel the amplitude of the unscattered wave there. 



14.6 Ramsauer-Townsend effect 

This is the name given to the fact that electrons with energy about leV can pass almost freely 
through Xe, Kr, and Ar:- there is a sharp minimum in electron scattering cross-section for these 
noble gases. 

Due to polarisation of these atoms by the incoming electron the potential appears to increase as 
K increases (more localised electrons are better able to polarise the atom). Thus 5o{k — > 0) = nir, 
in accordance with Levinson's theorem, and 5o initially increases as k increases, before eventually 
decreasing. Thus at a certain value of k, the phase shift is again 5o(k) = nn, and the total scattering 
cross section o T has an abrupt minimum. Although there are subsequent s-wave minima at e.g. 
So(k) — (n — l)n, these occur at sufficiently large values of k that s-wave scattering is no longer 
dominant. 




Figure 15: Minimum in scattering cross section in Ar due to 5 = 3n; No such effect in Ne due to 
weaker polarisation. 

By contrast, neon and helium have lower polarisability, due to fewer bound electrons. Thus the 
phase shift S decreases monotonically with k from nn at k = at there is no low-energy minimum. 

Higher / phase shifts may increase with k because higher k implies smaller impact parameter 
(classically, more chance of hitting the atom). The cross section increases more slowly due to the 
additional K~ 2 dependence. The maximum in the Ar cross section at about 13eV is mainly due 
to the ci-wave <5 2 = ir/2. 




Figure 16: More-localised electrons polarise atoms and thus increase the attractive potential 



15 Bits and pieces 

15.1 Casimir effect - forces from nothing 

For many quantum systems, such as the harmonic oscillator, there is still some energy associated 
with the lowest quantum state. This "zero-point" energy is real, and can be measured in the 
'Casimir effect'. There is a force between two metallic plates in a vacuum, because moving them 
would change the wavelength/energy of the zero-point quantised electromagnetic waves between 
them: this change in energy in response to a move equates to a force. 

The wavefunction for transverse standing electromagnetic waves between plates of area A sepa- 
rated by a in the z-direction is: 

$ n = exp[i(k.r — u n t)} sm(k n z) 



where k lies in the xy plane and k n - 

and the force per unit area is F 

Solving this involves a trick of multiplying each term by |u; ra |~ s , then taking the limit of s = 0. 
This tiny attractive force has now been measured (Bressi, Phys.Rev Letters, 2002) 



-rnr/a. The energy is E n = hco n = hc/X = hc\J k 2 + k\ 

dE d / /• ~ \ „ „ l/n N2 heir 2 
= ~la=Ta [ h J J> B J d ^/W = -240^ 



15.2 What does it mean: Wavefunction collapse and the EPR paradox 

The interpretation of collapsing wavefunctions is often regarded as unphysical, or philosophically 
problematic. There appears to be a contradiction with relativity in the idea that the wavefunction 
collapses instantaneously throughout space, although the wavefunction is not measurable. 

An attractive contrary view to the idea of 'measurement collapsing the wavefunction' is that for 
a particular system the value of a observable is a property of the particle, and the wavefunction 
only expresses averages over many particles. This kind of property is known as a hidden variable. 
As we shall see, this interpretation of quantum mechanics can be tested, and is inconsistent with 
experimental results. 

Consider a two-photon decay from a source (e.g. 40 Ca). Two polarisers are oriented along the 
z-direction, and we detect whether or not the photons pass through the polariser. 

The decay is one in which angular momentum is conserved, so the photons must be either both 
right-polarised (e^) or both left-polarised (e^) (they travel in opposite directions). We are dealing 
with bosons, so the wavefunction can be written as a superposition: 

1 12) = (e 1R e 2R + ei L e 2 L) 
Now convert into x and y polarisation using e R = (e^ — ie y ) and e^ = (e x + ie y ) to give 

1 12) = ^ (e lx e 2x + e ly e 2y ) 

From this we can clearly see that the quantum probability of the photon 1 passing through 
its detector is |, and if so the wavefunction collapses onto |12) = ei^a; and the conditional 
probability of the second photon passing through its detector is then 1. Thus quantum mechanics 
tells us that the probability of both detectors counting is \. 

Contrariwise, a hidden variables argument might say that on production the photons were po- 
larised in a random direction, say 6 to the x-axis. In this case the probability of passing through 



either detector would be cos 2 9, and the probability of simultaneous counts will be (cos 4 9) = 3/8. 
The mathematics for particles with correlated spins is similar. 

Since the wavefunction collapse and hidden variable approach give different answers, we can do 
an experiment to see which is correct. 

15.3 Hidden Variables: Bell's Inequality and Aspect's experiment 




Detector Analyser Source Analyser Detector 



Figure 17: Aspect's Experiment: The polarisations of both photons from the two-photon 40 Ca 
source are measured by analysers at angles of 9 and 0. 

Consider extending the experiment described above to the case of analysers at arbitrary angles 
which detect all photons. We define measurables a(0) and b(0) as +1 if the photon is aligned 
with the analyser and -1 if it is opposed. What, then, is the ensemble average value of P(0, 0) = 
(a(6)b((f))) ? Clearly, if a(#) and b(0) are uncorrelated P=0, but since they come from a common 
source, this is not the case: their wavefunctions are sometimes referred to as 'entangled'. 

If the photons start out with 'hidden variable' polarisation Xi then it is easily shown that: 

Phv(0, 4>) = ^f (cos 2 (# - x) ~ sm 2 (# - x)) (cos 2 (0 - X ) - sin 2 (0 - X j) d X = \ cos 2(9 - 0) 
Meanwhile if the wavefunction collapses at the first measurement, taken arbitrarily as A: 
Pqm(0, 4>) = ^j> H 2 (# - X ) ~ sm 2 (# - x)) (cos 2 (# - 0) - sin 2 (0 - 0)) d X = cos 2(9 - 0) 

In 1982, to test this Aspect carried out measurements on 40 Ca decays using two different angles 
for both 9 and 0. The quantity he evaluated was: 

S(9 1} 0x, 2 , 2 ) = P(0x, 0i) + P(9 2 , 2 ) + P(9 2 , 00 - P(9 U 2 ) 

Where he chose the values which give the largest S: 9\ = 0i + | = 9 2 + = 02 + ^ 

The hidden variables theory suggests the result should be S=v^2, while the wavefunction collapse 
suggests S=2\/2 with perfect measurement devices. Imperfections in the measurement will reduce 
the measured correlation in each case. Aspect measured S = 2.697±0.015, confirming the quantum 
prediction. 



The apparent complexity of Aspect's experiment is needed to eliminate sources of error due to 
detector, analyser and source imperfections. 

There is an apparent contradiction between quantum mechanics and relativity, in that the in- 
terpretation of quantum mechanics requires instantaneous collapse of the wavefunction. There 
is no measurable quantity for which the two theories give different predictions. "Teleportation" 
can transport a quantum state arbitrary distances, but it doesn't transfer information instantat- 
neously. 

Most of the wavefunctions we have solved are from Schrodingers equation, which treats time and 
space in different ways. For a properly relativistic approach, they should be equivalent. This 
discrepancy between quantum and relativity is easily resolved: the Dirac equation provides a fully 
relativistic wave equation for which the Schrodinger equation is a low energy approximation. A 
nice thing about the Dirac equation is it can only be solved by spinors: as with quantisation the 
observed physics turns out to be the only way to solve the mathematics. 

The three original papers described in this section are beautifully clear, copies are linked from the 
course webpage. 

15.4 When can things interfere? What counts as a measurement? 

Interference from two slits of a single particle with itself remains a difficult concept to understand. 



Figure 18: Feynman's 'classical' explanation of the destruction of the interference pattern by 
measurement, and two separate demonstrations that it is really a quantum effect 

Feynman introduced an nice argument based on the uncertainty principle. He argued that the 
wavelength of light required to detect which way a particle went must be smaller than the slit 
separation. From the uncertainty principle, it follows that the momentum transfer must be so 
large that it would destroy the interference pattern. Thus the measurement device destroyed the 
interference. Unfortunately, more recent experiments show things are more complicated than that. 

Eichmann et al (Phys. Rev. Lett, 1993) set up a 'two slit' experiment using photon with lead atoms 
as the scatterers. With careful choice of energy, he was able to arrange that the scattering event 
changed the internal electronic state of the atom: a process which requires negligible momentum 
transfer but would allow subsequent measurement of the atomic state and determination which 
way the particle went. As a consequence, the interference fringes vanish. 

Durr et al (Nature, 1998) used a standing light wave to scatter rubidium atoms. Added to this 
was a microwave source which changed the hyperfine state of the atoms at one of the "slits", 
which could in principle be measured but supplies negligible momentum. The interference pattern 
disappeared. 

Again, quantum mechanics has been shown to give a correct description: non-identical wavefunc- 
tions do not interfere even if they describe the same particle! It does not matter whether the 
measurement of the internal states is actually performed: the mere fact that it could be is enough 
to destroy the interference. 



Scattered Photon 



Altered quantum state 




15.5 Relativistic Quantum Mechanics 



The Schroedinger equation itself it clearly inconsistent with relativity; It has second deriva- 
tives of space, and first derivatives of time. If we use the relativistic expression for energy 
E 2 = \p\ 2 c 2 + m 2 c 4 we obtain 

-h 2 — 0M) = -ftW 2 0M) + m 2 cVM) 

which is called the Klein Gordon equation. It has solutions describing a relativistic quantum 
particle, but others which describe particles of negative total energy, together with negative prob- 
abilities for finding them! Applied to hydrogen it gets the relativistic kinetic energy correction 
correct, but it doesn't account for other observed relativistic effects, such as the spin-orbit correc- 
tion or the Darwin term (see Atomic and molecular physics). 

Dirac tried keeping time and space on an equal footing using a linear equation 



ih^r ip(r,t) = \ca ■ p + (3mc 2 \ ib(r, i) = Hibiv.t) where a-p = — %h \ a x -^- + a v ^- + a z ^— 
at L — - > —- \ ox ay oz 



Consider a free particle, no terms in the Hamiltonian H should depend on r or t as these would 
describe forces. Dirac assumed that on and (3 are independent of position, time, momentum and 
energy, so a and (3 commute with r, t, p and E but not necessarily with each other. 

Since relativistic invariance must be maintained, ie E 2 = \p\ 2 c 2 + m 2 c 4 , 

H 2 iP(r,t) = (c 2 \p\ 2 + m 2 c 4 ) ijj(r,t) 

= [ca-p + f3mc 2 }\ca-p + /3mc 2 | ip(r,t) 

Expand the RHS of this equation, being very careful about the ordering of ai and f3 

H 2 ^(r, t) 

= [c 2 [K) 2 (p x ) 2 + {a y ) 2 {p y ) 2 + {a z ) 2 {p z ) 2 ] + m 2 c 4 /3 2 }^(r,t) 

+ c 2 ^(a x a y + a y a x ) p x p y + (a y a z + a z a y ) p y p z + (a z a x + a x a z )p x pjj ip(r, t) 

+ mc 3 | (a x (3 + pa x ) p z + (a y (3 + (5a y ) p z + (a z (3 + (5a z ) p z J ^(r, t) 

relativistic invariance for the free particle requires that the second and third term are zero, and 
so 

K) 2 = (a y ) 2 = (a z ) 2 = P 2 = 1 

aiaj + ctjOCi = (i ^ j) 
a x (3 + (3 a x = (and similarly for y, z) 



Thus ctj and (3 cannot be just numbers. The simplest representation for a and (3 are 4x4 matrices, 
meaning that the wavevector is a 4-component vector. When we work this through, there are no 
negative probabilities, but two of the components turn out to have negative energy. Full details 
of the derivation are on the course website. 

It turns out that the four components accurately describe the two spin states of the electron 
and the positron. More remarkably, Dirac solved the equation before the positron had even been 
discovered! 



